Program/Learn how to model the words of your language in Wikidata
- Notes
- Format
- Keywords
- Wikidata
- lexicographic data
- minority languages
- underrepresented languages
- linguistics
- computational linguistics
- language technology
- When and where
- Sunday 22 July, 14:30
- Duration: 55 minutes
- Mexico City
-->
- Presenter(s)
- Topic area
- Relationship to theme
Abstract
Wikidata has been collecting and providing all people the free access to general-purpose data for several years. Now it is also covering the lexicographical data, i.e. data about words, phrases, and more. The goal of this endeavour is to enable all people to both document and actively access the very foundation of any piece of human knowledge: the language.
The data model has been designed on Wikidata for needs of of this work, and a lot effort has been put into making the data model flexible to address the sheer diversity and variety of human languages. That has also made the structure quite abstract, and it has been so far mostly documented with dominant Indo-European languages in mind. The goal of this session is to change this and to start filling the gaps, enabling the knowledge of all languages to represented equally and thoroughly in Wikidata.
The session will kick off with a brief introduction to the lexicographical data model of Wikidata, and then participants will proceed to discuss specificities of languages they speak, and collaboratively work on modelling those languages on Wikidata. Participants will together explore alternative ways to model particular language phenomena, and discover the most accurate approaches for specific examples and general practices.
No formal linguistic knowledge is required to participate in the session. Prior experience with Wikidata is not required. Speakers of all languages are welcome the session. The session will be mainly conducted in English. Session moderator will also be able to answer questions in French and German.
Outcomes
Participants of the session will learn how to model words, phrases and other elements of languages they speak, as well as other languages, on Wikidata. They will be able to explain and discuss topics related to lexicographical data on Wikidata in their own communities, allowing the knowledge of their languages to be documented and made available for all people. Attendees will understand how language data is structured on Wikidata, and will be empowered to create tools and other means of both contributing to the knowledge stored on Wikidata, and making the use of this knowledge for all people, inside Wikimedia world and beyond.
Documents
Signup
- Rehman 10:03, 24 June 2018 (UTC)
- Armineaghayan (talk) 25 June 2018
- <add your username here if you are interested in attending>