- 1. THE CORPUS
- 2. USING THE CORPUS
- 3. PROJECT/PUBLICATIONS
This is an old revision of the document!
After the Collecting the data, we had around 650 chats in different languages but no idea which chat was in which language. Furthermore, we had given a promise to anonymize the data but we did not have a tool to browse the data in the available format. Thus, before making the data available to the research team, we had to pre-process them.