A Large-Scale Social Media Corpus for the Detection of Youth Depression (Project Note)
Social media is frequently used by youth to share their health and mental issues. Therefore, social media has become a major online resource to study the language used to express issues such as depression and self-harm which can help to identify individuals at risk of harm. Furthermore, depression and suicide are generally closely related especially that depression is the most common symptom associated with self-harm acts such as suicide. In this project, we propose to build a linguistically annotated corpus with the sentiment analysis in order to study the youth behavior through their social media discourse across the MENA region. We plan to create a large-scale dataset of users with self-reported depression messages. Several correlational analyses will be performed to understand the psycho-social-behaviors. We plan to annotate the collected corpus using a team of dedicated annotators from various Arabic countries. Moreover, we will use various natural language processing (NLP) tools and techniques to reveal the linguistic patterns and the sentiments expressed by these tweets. Finally, we will apply machine learning (ML) methods to build behavior prediction tools using the annotated corpus. We believe that the annotated corpus to will be a valuable resource to be used by linguists, sociologists, computer scientists, psychologists, policy makers, etc.
Other Information
Published in: Procedia Computer Science
License: http://creativecommons.org/licenses/by-nc-nd/4.0/
See article on publisher's website: https://dx.doi.org/10.1016/j.procs.2018.10.483
Funding
Open Access funding provided by the Qatar National Library.
History
Language
- English
Publisher
ElsevierPublication Year
- 2018
License statement
This Item is licensed under the Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.Institution affiliated with
- Hamad Bin Khalifa University
- College of Health and Life Sciences - HBKU