TuLeD (Tupían Lexical Database) is being compiled within the CrossLingference project. It is the first part of TuLaR (Tupían language resources) to be published online. It offers a comprehensive list of concepts comprising both Swadesh and Tupían culturally relevant items for all branches of the language family. Semantic fields for the concepts are taken from The World Loanword Database (WOLD) , and each concept is linked to a Concepticon id. The data is presented in CLDF format and the transcription follows the CLTS. The data is intended to be used in historical and computational linguistics working on language evolution and language contact.
The current pre-release (v0.10) includes 417 concepts across 79 languages, living and extinct, with a coverage ranging up to 98%. Cognacy assignent was first done automatically, and is now been improved through expert judgement.
The content of this web site is published under a Creative Commons Licence. We invite the community of users to think about further applications for the available data and look forward to your comments, feedback and questions.
Gerardi, F.F., Reichert, S.& Aragon, C. . TuLeD (Tupían lexical database): introducing a database of a South American language family. Lang Resources& Evaluation (2021). https://doi.org/10.1007/s10579-020-09521-5
TuLeD is supported by European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (Grant agreement No. 834050).