El tratamiento de las erratas desde la sicolingüística y desde el procesamiento del lenguaje natural. Un estado de la cuestión.

  1. Santiago Rodríguez-Rubio 1
  2. Nuria Fernández-Quesada 1
  1. 1 Universidad Pablo de Olavide
    info

    Universidad Pablo de Olavide

    Sevilla, España

    ROR https://ror.org/02z749649

Journal:
Tonos digital: revista de estudios filológicos

ISSN: 1577-6921

Year of publication: 2021

Issue: 40

Type: Article

More publications in: Tonos digital: revista de estudios filológicos

Abstract

Throughout time, the study of typographical errors has been approached from a variety of fields, such as bibliography, textual criticism, publishing, metalexicography, psycholinguistics and natural language processing (NLP). However, typographical errors are inherent in written texts, and there is no multidisciplinary and allencompassing method for their treatment. For that reason, we present a theoretical review of typographical errors and their categorisation, especially from the perspective of psycholinguistics and NLP. On the one hand, some of the most thorough studies from the XX century onwards have been conducted in those two fields. On the other hand, there are remarkable similarities, regarding both the conclusions reached (e.g. the low frequency of errors in the first letter of words) and the limitations encountered (e.g. ambiguity in error categorisation). Thus, our starting point will be the theoretical distinction between motoric or execution errors (mistypings) and orthographic or cognitive errors (misspellings). Moreover, we will analyse the commonalities between the abovementioned disciplines when it comes to identifying the wrong operations (omission, insertion, substitution, and transposition) behind those phenomena. Finally, we will confirm that sometimes execution errors and competence errors cannot be distinguished in the practice of writing.

Bibliographic References

  • Arroyo, L. (2017). Lingüística de errores con fines computacionales. PatErr, un recurso para la revisión textual del español basado en patrones de error codificados. Tesis de doctorado, Facultad de Traducción e Interpretación, Universidad de Las Palmas de Gran Canaria, Las Palmas de Gran Canaria, España.
  • Baba, Y. & Suzuki, H. (2012). How Are Spelling Errors Generated and Corrected? A Study of Corrected and Uncorrected Spelling Errors Using Keystroke Logs. En H. Li, C-Y. Lin, M. Osborne, G. G. Lee & J. C. Park (Eds.), Proceedings of the 50th Annual Meeting of the Association for Computational Linguistics, Vol.2 (pp. 373- 377). Jeju Island, Korea: Association for Computational Linguistics (ACL).
  • Ballard, T. (2008). Systematic Identification of Typographical Errors in Library Catalogs. Cataloging and Classification Quarterly, 46(1), 27-33.
  • Beall, J. (2005). Metadata and Data Quality Problems in the Digital Library. Journal of Digital Information, 6(3), 1-20.
  • Blair, C. R. (1960). A Program for Correcting Spelling Errors. Information and Control, 3(1), 60-67.
  • Blecua, A. (1983). Manual de crítica textual. Madrid: Editorial Castalia.
  • Damerau, F. J. (1964). A Technique for Computer Detection and Correction of Spelling Errors. Communications of the ACM, 7(3), 171-176.
  • Deorowicz, S. & Ciura, M. G. (2005). Correcting Spelling Errors by Modelling their Causes. International Journal of applied mathematics and computer science (AMCS), 15(2), 275-285.
  • Fernández-Ordóñez, I. (2002). Tras la collatio o cómo establecer correctamente el error textual. La corónica: A Journal of Medieval Spanish Language and Literature, 30(2), 105-180.
  • Grouin, C. (2008). Certification and cleaning up of a text corpus: towards an evaluation of the “grammatical” quality of a corpus. En N. Calzolari, Kh. Choukri, B. Maegaard, J. Mariani, J. Odijk, S. Piperidis & D. Tapias (Eds.), Proceedings of the 6th International Conference on Language Resources and Evaluation (LREC2008) (pp. 1083-1090). Marrakech, Morocco: European Language Resources Association (ELRA).
  • Grudin, J. T. (1983). Error Patterns in Novice and Skilled Transcription Typing. En W. E. Cooper (Ed.), Cognitive Aspects of Skilled Typewriting (pp. 121-143). New York: Springer.
  • Hartley, T. (2009). Technology and Translation. En J. Munday (Ed.), The Routledge Companion to Translation Studies (pp. 106-127). Abingdon/New York: Routledge.
  • Iamartino, G. (2017). Lexicography, or the Gentle Art of Making Mistakes. Altre Modernità (Numero speciale — Errors: Communication and its Discontents), 48- 78.
  • Kano, A., Read, J. C., Dix, A. & MacKenzie, S. (2007). ExpECT: An Expanded Error Categorisation Method for Text Input. En L. J. Ball, M. A. Sasse, C. Sas, T. C. Ormerod, A. Dix, P. Bagnall & T. McEwan (Eds.), People and Computers XXI –
  • HCI… but not as we know it. Proceedings of HCI 2007 (pp. 147-156). Swindon: British Computer Society.
  • Kano, A. & Read, J. C. (2009). Text Input Error Categorisation: Solving Character Level Insertion Ambiguities Using Zero Time Analysis. En A. F. Blackwell (Ed.), People and Computers XXIII – Celebrating People and Technology: HCI 2009 (pp. 293-302). Cambridge: British Computer Society.
  • Kordić, R. & Avilés, T. (2016). ¿Variante lingüística o error de copista? Hipogrifo, 4(1), 199-215.
  • Kukich, K. (1992). Techniques for Automatically Correcting Words in Text. ACM Computing Surveys, 24(4), 377-439.
  • Lashley, K. S. (1951). The Problem of Serial Order in Behavior. En L. A. Jeffress (Ed.), Cerebral mechanisms in behavior (pp. 112-146). New York: Wiley.
  • Lessenberry, D. D. (1928). Analysis of Errors. Syracuse/New York: L. C. Smith and Corona Typewriters, School Department.
  • Logan, F. A. (1999). Errors in Copy Typewriting. Journal of Experimental Psychology: Human Perception and Performance, 25(6), 1760-1773.
  • Luelsdorff, P. (1986). Constraints on Error Variables in Grammar: Bilingual Misspelling Orthographies. Philadelphia: John Benjamins.
  • MacKellar, T. (1893). The American Printer: A Manual of Typography. Philadelphia: MacKellar, Smiths & Jordan Foundry.
  • MacNeilage, P. F. (1964). Typing Errors as Clues to Serial Ordering Mechanisms in Language Behaviour. Language and Speech, 7(3), 144-159.
  • Martínez de Sousa, J. (2000). Las erratas. Centro Virtual Cervantes (Rinconete > Lengua). Recuperado el 14 de Junio, 2020 de http://bit.ly/2v82eqn
  • Martínez de Sousa, J. (2005). Manual de edición y autoedición (2ª ed. quinta impresión 2016). Madrid: Ediciones Pirámide.
  • Mendikoetxea, A., Murcia, S. & Rollinson, P. (2010). Focus on Errors: Learner Corpora as Pedagogical Tools. En M. C. Campoy-Cubillo, B. Bellés-Fortuño & M. Ll. GeaValor (Eds.), Corpus-Based Approaches to English Language Teaching (pp. 180- 194). London/New York: Continuum.
  • Miller, G. A. & Friedman, E. A. (1957). The Reconstruction of Mutilated English Texts. Information and Control, 1, 38-55.
  • Min, K., Wilson, W. & Moon, Y-J. (2000). Typographical and Orthographical Spelling Error Correction. 2 nd International Conference on Language Resources and Evaluation 221. Athens, Greece.
  • Mitton, R. (1987). Spelling Checkers, Spelling Correctors and the Misspellings of Poor Spellers. Information Processing and Management, 23(5), 495-505.
  • Mitton, R. (1996). English spelling and the computer. London: Birkbeck ePrints.
  • Moure, J. L. (2006). Errores deseables y erratas cohonestadas. Páginas de guarda: revista de lenguaje, edición y cultura escrita, 1, 11-25.
  • Mugglestone, L. (2005). Lost for Words: The Hidden History of the Oxford English Dictionary. New Haven/London: Yale University Press.
  • Norman, D. A. (1981). Categorization of Action Slips. Psychological Review, American Psychological Association, 88(1), 1-15.
  • Oldfield, A. (1890). A Practical Manual of Typography and Reference Book for Printers. London: E. Menken, Technical Publisher.
  • Oppenheim, G. M. (2009). The Little Voice in your Head: Error-based Investigations of Abstracted and Articulated Inner Speech. B. A. Thesis, Grinnell College, University of Illinois, EUA.
  • Peterson, J. L. (1980). Computer Programs for Detecting and Correcting Spelling Errors. Communications of the ACM, 23(12), 676-687.
  • Peterson, J. L. (1986). A Note on Undetected Typing Errors. Communications of the ACM, 29(7), 633-637.
  • Pollock, J. J. & Zamora, A. (1984). Automatic Spelling Correction in Scientific and Scholarly Text. Communications of the ACM, 27(4), 358-368.
  • Ramírez, F. & López, E. (2006). Spelling Error Patterns in Spanish for Word Processing Applications. En 5 th International Conference on Language Resources and Evaluation (pp. 93-98). Genoa, Italy: European Language Resources Association (ELRA).
  • Read, A. W. (1978). The Sources of Ghost Words in English. Word, 29(2), 95-104.
  • Ren, X. & Perrault, F. (1992). The Typology of Unknown Words: An Experimental Study of Two Corpora. En COLING '92 Proceedings of the 14th Conference on Computational Linguistics, Vol. 1 (pp. 408-414). Nantes, France: Association for Computational Linguistics (ACL).
  • Rumelhart, D. E. & Norman, D. A. (1982). Simulating a Skilled Typist: A Study of Skilled Cognitive-Motor Performance. Cognitive Science, 6, 1-36.
  • Salthouse, T. A. (1986). Perceptual, Cognitive, and Motoric Aspects of Transcription Typing. Psychological Bulletin, 99(3), 303-319.
  • Van Berkel, B. & De Smedt, K. (1988). Triphone Analysis: A Combined Method for the Correction of Orthographical and Typographical Errors. En Proceedings of the 2nd Conference on Applied Natural Language Processing (ANLP) (pp. 77-83). Austin, Texas, EUA: Association for Computational Linguistics (ACL).
  • Véronis, J. (1988). Morphosyntactic correction in natural language interfaces. En J. Vargha (Ed.), COLING '88 Proceedings of the 12th Conference on Computational Linguistics, Vol. 2 (pp. 708-713). Budapest: Hungary. Association for Computational Linguistics (ACL).
  • Wells, F. L. (1916). On the Psychomotor Mechanisms of Typewriting. The American Journal of Psychology, 27(1), 47-70.
  • Wheatley H. B. (1893). Literary Blunders: A Chapter in the “History of Human Error”. London: Elliot Stock.