General Resources for Your Computational Linguistics Journey
NOTE: Your favourite resource does not appear in the list? You MUST let us know! :) Either send a pull request or contact annika.ott@student.uni-tuebingen.de to do it for you.
Conferences
-
CICLing: International Conference on Computational Linguistics and Intelligent Text Processing
-
EMNLP: Empirical Methods in Natural Language Processing
Data Collections & Corpora
- British National Corpus (BNC)
- 100 million words of text
- range of genres: spoken, fiction, magazines, newspapers, academic, etc.
- Corpus of Contemporary American English (COCA)
- 25+ million words each year, 1990-2019
- Linguistic Data Consortium (LDC)
- creation, collection and distribution of speech and text databases, lexicons, and other resources for linguistics research and development purposes
- Manifesto Project
- annotated collection of electoral programmes
- 100 parties from more than 50 countries from 1945 until today
- OPUS
- collection of translated texts from the web
Recommended Readings
Intro Level
Current Trends
Tools
- From Data to Viz
- helps you find the appropriate visualisation for your data
- Overleaf
- cloud-based LATEX editor to be used in your browser
Tutorials
Videos
Written Form
Other
- Diversity in Linguistics
- they offer a ‘Statistics for Linguistics’ course