Organization:CIRCSE

From HandWiki

The Centro Interdisciplinare di Ricerche per la Computerizzazione dei Segni dell’Espressione[1](CIRCSE) is an interdisciplinary research centre of the Università Cattolica del Sacro Cuore (Milan, Italy). Its expertise lies in the fields of Computational Linguistics, Natural Language Processing, Information Extraction and Philosophy of Language with a particular focus on ancient or historical natural language texts.

The CIRCSE is the recipient of a number of research awards, including grants from the Marie Sklodowska-Curie Actions (no. 658332-WFL, Word Formation Latin Lexicon[2]) and from the European Research Council (no. 769994, LiLa: Linking Latin).

History

The CIRCSE was founded in 2009 by Marco Passarotti and Savina Raynaud as the formal embodiment of the former GIRCSE (Gruppo Interdisciplinare di Ricerche per la Computerizzazione dei Segni dell'Espressione), a research group built around Roberto Busa SJ, a pioneer in the use of computers for linguistic and literary analysis and author of the Index Thomisticus, and Chiara Colombo. The GIRCSE's active involvement in the humanities computing workshops held at the Università Cattolica since 1978 make it one of the first computational linguistics research groups ever founded in Italy. The research group was locally supported by the late director of the Institute of Glottology, Prof. Giancarlo Bolognesi, and included Paolo Branca, Paola Pontani and Alberto Cordone as its members.

Linguistic Resources

The CIRCSE has produced the following resources:

  • Index Thomisticus Treebank (IT-TB)[3], available in its original version[4] and in Universal Dependencies format.[5]
  • Latin-VALLEX[6][7]
  • IT-VaLex[8][9]
  • Latin Affectus Lexicon[10][11]
  • Latin Word Embeddings[12][13]
  • Version 3.0 of the LEMLAT morphological analyser for Latin[14][15]
  • Word Formation Latin Lexicon (WFL)[16] [17]

References

  1. "CIRCSE website" (in it-it). https://centridiricerca.unicatt.it/circse-centro-interdisciplinare-di-ricerche-per-la-computerizzazione-dei-segni-dell-il-centro-di-ricerca. 
  2. "Word Formation Latin". http://wfl.marginalia.it/. 
  3. Passarotti, Marco Carlo (2019). The Project of the Index Thomisticus Treebank. 10. De Gruyter. ISBN 978-3-11-059678-6. https://publicatt.unicatt.it/handle/10807/141133. 
  4. "Index Thomisticus Treebank". https://itreebank.marginalia.it/. 
  5. "UD_Latin-ITTB". https://universaldependencies.org/treebanks/la_ittb/index.html. 
  6. Passarotti, Marco Carlo; Gonzalez Saavedra, Berta; Onambele Manga, Christophe Ledoux (2016). Latin Vallex. A Treebank-based Semantic Valency Lexicon for Latin. European Language Resources Association (ELRA). pp. 2599–2606. ISBN 978-2-9517408-9-1. https://publicatt.unicatt.it/handle/10807/78696. 
  7. GitHub CIRCSE/Latin-VALLEX, CIRCSE Research Centre, 2019-01-25, https://github.com/CIRCSE/Latin-VALLEX, retrieved 2020-04-11 
  8. Mcgillivray, Barbara; Passarotti, Marco Carlo (2009). The Development of the Index Thomisticus Treebank Valency Lexicon. EACL. pp. 1–8. ISBN 978-1-61738-560-5. https://publicatt.unicatt.it/handle/10807/1418. 
  9. GitHub CIRCSE/ITVALEX, CIRCSE Research Centre, 2019-05-18, https://github.com/CIRCSE/ITVALEX, retrieved 2020-04-11 
  10. Sprugnoli, Rachele; Passarotti, Marco; Corbetta, Daniela; Peverelli, Andrea (2020). Odi et Amo. Creating, Evaluating and Extending Sentiment Lexicons for Latin. European Language Resources Association (ELRA). pp. 3078–3086. doi:10.5281/zenodo.3862149. ISBN 979-10-95546-34-4. https://publicatt.unicatt.it/handle/10807/154884. 
  11. GitHub CIRCSE/Latin_Sentiment_Lexicons, CIRCSE Research Centre, 2020-03-02, https://github.com/CIRCSE/Latin_Sentiment_Lexicons, retrieved 2020-04-11 
  12. Sprugnoli, Rachele; Passarotti, Marco; Moretti, Giovanni (2019). Vir is to Moderatus as Mulier is to Intemperans. Lemma Embeddings for Latin. Accademia University Press. pp. 1–7. doi:10.5281/zenodo.3565572. ISBN 979-12-80136-00-8. https://publicatt.unicatt.it/handle/10807/144302. 
  13. "Lemma embeddings for Latin - vir is to moderatus as mulier is to intemperans". https://embeddings.lila-erc.eu/. 
  14. Passarotti, Marco Carlo; Budassi, Marco; Litta, Eleonora Maria; Ruffolo, Paolo (2017). The Lemlat 3.0 Package for Morphological Analysis of Latin. Northern European Association for Language Technology (NEALT). ISBN 978-91-7685-503-4. https://publicatt.unicatt.it/handle/10807/100472. 
  15. GitHub CIRCSE/LEMLAT3, CIRCSE Research Centre, 2020-03-30, https://github.com/CIRCSE/LEMLAT3, retrieved 2020-04-11 
  16. Litta, Eleonora (2018). Morphology Beyond Inflection. Building a Word Formation Based Lexicon for Latin. Cambridge Scholars Publishing. pp. 97–114. ISBN 978-1-5275-0803-3. https://publicatt.unicatt.it/handle/10807/130504. 
  17. GitHub CIRCSE/WFL, CIRCSE Research Centre, 2019-01-29, https://github.com/CIRCSE/WFL, retrieved 2020-04-11