Computational linguistics

Short description: Use of computational tools for the study of linguistics

Computational linguistics is an interdisciplinary field concerned with the computational modelling of natural language, as well as the study of appropriate computational approaches to linguistic questions. In general, computational linguistics draws upon linguistics, computer science, artificial intelligence, mathematics, logic, philosophy, cognitive science, cognitive psychology, psycholinguistics, anthropology and neuroscience, among others.

Origins

The field overlapped with artificial intelligence since the efforts in the United States in the 1950s to use computers to automatically translate texts from foreign languages, particularly Russian scientific journals, into English.^[1] Since rule-based approaches were able to make arithmetic (systematic) calculations much faster and more accurately than humans, it was expected that lexicon, morphology, syntax and semantics can be learned using explicit rules, as well. After the failure of rule-based approaches, David Hays^[2] coined the term in order to distinguish the field from AI and co-founded both the Association for Computational Linguistics (ACL) and the International Committee on Computational Linguistics (ICCL) in the 1970s and 1980s. What started as an effort to translate between languages evolved into a much wider field of natural language processing.^[3]^[4]

Annotated corpora

In order to be able to meticulously study the English language, an annotated text corpus was much needed. The Penn Treebank^[5] was one of the most used corpora. It consisted of IBM computer manuals, transcribed telephone conversations, and other texts, together containing over 4.5 million words of American English, annotated using both part-of-speech tagging and syntactic bracketing.^[6]

Japanese sentence corpora were analyzed and a pattern of log-normality was found in relation to sentence length.^[7]

Modeling language acquisition

The fact that during language acquisition, children are largely only exposed to positive evidence,^[8] meaning that the only evidence for what is a correct form is provided, and no evidence for what is not correct,^[9] was a limitation for the models at the time because the now available deep learning models were not available in late 1980s.^[10]

It has been shown that languages can be learned with a combination of simple input presented incrementally as the child develops better memory and longer attention span,^[11] which explained the long period of language acquisition in human infants and children.^[11]

Robots have been used to test linguistic theories.^[12] Enabled to learn as children might, models were created based on an affordance model in which mappings between actions, perceptions, and effects were created and linked to spoken words. Crucially, these robots were able to acquire functioning word-to-meaning mappings without needing grammatical structure.

Using the Price equation and Pólya urn dynamics, researchers have created a system which not only predicts future linguistic evolution but also gives insight into the evolutionary history of modern-day languages.^[13]

Chomsky's theories

Attempts have been made to determine how an infant learns a "non-normal grammar" as theorized by Chomsky normal form without learning an "overgeneralized version" and "getting stuck".^[9]

References

↑ John Hutchins: Retrospect and prospect in computer-based translation. Proceedings of MT Summit VII, 1999, pp. 30–44.
↑ "Deceased members". http://nlp.shef.ac.uk/iccl/committee.html#deceased.
↑ Natural Language Processing by Liz Liddy, Eduard Hovy, Jimmy Lin, John Prager, Dragomir Radev, Lucy Vanderwende, Ralph Weischedel
↑ Arnold B. Barach: Translating Machine 1975: And the Changes To Come.
↑ Marcus, M.; Marcinkiewicz, M. (1993). "Building a large annotated corpus of English: The Penn Treebank". Computational Linguistics 19 (2): 313–330. https://www.aclweb.org/anthology/J/J93/J93-2004.pdf.
↑ Taylor, Ann (2003). "1". Treebanks. Spring Netherlands. pp. 5–22.
↑ Furuhashi, S.; Hayakawa, Y. (2012). "Lognormality of the Distribution of Japanese Sentence Lengths". Journal of the Physical Society of Japan 81 (3): 034004. doi:10.1143/JPSJ.81.034004. Bibcode: 2012JPSJ...81c4004F.
↑ Bowerman, M. (1988). The "no negative evidence" problem: How do children avoid constructing an overly general grammar. Explaining language universals.
↑ ^9.0 ^9.1 Braine, M.D.S. (1971). On two types of models of the internalization of grammars. In D.I. Slobin (Ed.), The ontogenesis of grammar: A theoretical perspective. New York: Academic Press.
↑ Powers, D.M.W. & Turk, C.C.R. (1989). Machine Learning of Natural Language. Springer-Verlag. ISBN:978-0-387-19557-5.
↑ ^11.0 ^11.1 Elman, Jeffrey L. (1993). "Learning and development in neural networks: The importance of starting small". Cognition 48 (1): 71–99. doi:10.1016/0010-0277(93)90058-4. PMID 8403835.
↑ Salvi, G.; Montesano, L.; Bernardino, A.; Santos-Victor, J. (2012). "Language bootstrapping: learning word meanings from the perception-action association". IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics 42 (3): 660–71. doi:10.1109/TSMCB.2011.2172420. PMID 22106152.
↑ Gong, T.; Shuai, L.; Tamariz, M.; Jäger, G. (2012). E. Scalas. ed. "Studying Language Change Using Price Equation and Pólya-urn Dynamics". PLOS ONE 7 (3): e33171. doi:10.1371/journal.pone.0033171. PMID 22427981. Bibcode: 2012PLoSO...733171G.

External links

0.00

(0 votes)

Original source: https://en.wikipedia.org/wiki/Computational linguistics. Read more

[1] John Hutchins: Retrospect and prospect in computer-based translation. Proceedings of MT Summit VII, 1999, pp. 30–44.

[2] "Deceased members". http://nlp.shef.ac.uk/iccl/committee.html#deceased.

[3] Natural Language Processing by Liz Liddy, Eduard Hovy, Jimmy Lin, John Prager, Dragomir Radev, Lucy Vanderwende, Ralph Weischedel

[4] Arnold B. Barach: Translating Machine 1975: And the Changes To Come.

[5] Marcus, M.; Marcinkiewicz, M. (1993). "Building a large annotated corpus of English: The Penn Treebank". Computational Linguistics 19 (2): 313–330. https://www.aclweb.org/anthology/J/J93/J93-2004.pdf.

[6] Taylor, Ann (2003). "1". Treebanks. Spring Netherlands. pp. 5–22.

[autogenerated3-7] Furuhashi, S.; Hayakawa, Y. (2012). "Lognormality of the Distribution of Japanese Sentence Lengths". Journal of the Physical Society of Japan 81 (3): 034004. doi:10.1143/JPSJ.81.034004. Bibcode: 2012JPSJ...81c4004F.

[8] Bowerman, M. (1988). The "no negative evidence" problem: How do children avoid constructing an overly general grammar. Explaining language universals.

[autogenerated1971-9] 9.0 ^9.1 Braine, M.D.S. (1971). On two types of models of the internalization of grammars. In D.I. Slobin (Ed.), The ontogenesis of grammar: A theoretical perspective. New York: Academic Press.

[powers1989-10] Powers, D.M.W. & Turk, C.C.R. (1989). Machine Learning of Natural Language. Springer-Verlag. ISBN:978-0-387-19557-5.

[autogenerated1993-11] 11.0 ^11.1 Elman, Jeffrey L. (1993). "Learning and development in neural networks: The importance of starting small". Cognition 48 (1): 71–99. doi:10.1016/0010-0277(93)90058-4. PMID 8403835.

[12] Salvi, G.; Montesano, L.; Bernardino, A.; Santos-Victor, J. (2012). "Language bootstrapping: learning word meanings from the perception-action association". IEEE Transactions on Systems, Man, and Cybernetics - Part B: Cybernetics 42 (3): 660–71. doi:10.1109/TSMCB.2011.2172420. PMID 22106152.

[13] Gong, T.; Shuai, L.; Tamariz, M.; Jäger, G. (2012). E. Scalas. ed. "Studying Language Change Using Price Equation and Pólya-urn Dynamics". PLOS ONE 7 (3): e33171. doi:10.1371/journal.pone.0033171. PMID 22427981. Bibcode: 2012PLoSO...733171G.

[1]

[2]

[3]

[4]

[5]

[6]

[7]

[8]

[9]

[10]

[11]

[12]

[13]

v t e Computer science
Note: This template roughly follows the 2012 ACM Computing Classification System.
Hardware	Printed circuit board Peripheral Integrated circuit Very Large Scale Integration Systems on Chip (SoCs) Energy consumption (Green computing) Electronic design automation Hardware acceleration
Computer systems organization	Computer architecture Embedded system Real-time computing Dependability
Networks	Network architecture Network protocol Network components Network scheduler Network performance evaluation Network service
Software organization	Interpreter Middleware Virtual machine Operating system Software quality
Software notations and tools	Programming paradigm Programming language Compiler Domain-specific language Modeling language Software framework Integrated development environment Software configuration management Software library Software repository
Software development	Software development process Requirements analysis Software design Software construction Software deployment Software maintenance Programming team Open-source model
Theory of computation	Model of computation Formal language Automata theory Computational complexity theory Logic Semantics
Algorithms	Algorithm design Analysis of algorithms Algorithmic efficiency Randomized algorithm Computational geometry
Mathematics of computing	Discrete mathematics Probability Statistics Mathematical software Information theory Mathematical analysis Numerical analysis
Information systems	Database management system Information storage systems Enterprise information system Social information systems Geographic information system Decision support system Process control system Multimedia information system Data mining Digital library Computing platform Digital marketing World Wide Web Information retrieval
Security	Cryptography Formal methods Security services Intrusion detection system Hardware security Network security Information security Application security
Human–computer interaction	Interaction design Social computing Ubiquitous computing Visualization Accessibility
Concurrency	Concurrent computing Parallel computing Distributed computing Multithreading Multiprocessing
Artificial intelligence	Natural language processing Knowledge representation and reasoning Computer vision Automated planning and scheduling Search methodology Control method Philosophy of artificial intelligence Distributed artificial intelligence
Machine learning	Supervised learning Unsupervised learning Reinforcement learning Multi-task learning Cross-validation
Graphics	Animation Rendering Image manipulation Graphics processing unit Mixed reality Virtual reality Image compression Solid modeling
Applied computing	E-commerce Enterprise software Computational mathematics Computational physics Computational chemistry Computational biology Computational social science Computational engineering Computational healthcare Digital art Electronic publishing Cyberwarfare Electronic voting Video games Word processing Operations research Educational technology Document management
Book Category Outline WikiProject Commons

Anonymous

Search

Computational linguistics

Namespaces

More

Page actions

Contents

Origins

Annotated corpora

Modeling language acquisition

Chomsky's theories

See also

References

Further reading

External links

Navigation

Navigation

Help

Translate

Wiki tools

Wiki tools

Anonymous

Search

Computational linguistics

Origins

Annotated corpora

Modeling language acquisition

Chomsky's theories

See also

References

Further reading

External links

Navigation

Wiki tools

Page tools

Other projects

Categories