Departamento de Informática
Permanent URI for this community
Browse
Browsing Departamento de Informática by Author "Albardeiro, Miguel Ângelo Serra"
Now showing 1 - 2 of 2
Results Per Page
Sort Options
- SocialNetCrawler: Online Social Network CrawlerPublication . Pais, S.; Cordeiro, João; Martins, Ricardo; Albardeiro, Miguel Ângelo SerraThe emergence and popularization of online social networks suddenly made available a large amount of data from social organization, interaction and human behavior. All this information opens new perspectives and challenges to the study of social systems, be- ing of interest to many fields. Although most online social networks are recent, a vast amount of scientific papers was already published on this topic, dealing with a broad range of analytical methods and applications. Therefore, the development of a tool capable of gather tailored information from social networks is something that can help a lot of researchers on their work, especially in the area of Natural Language Processing (NLP). Nowadays, the daily base medium where people use more often text language lays precisely on social networks. Therefore, the ubiquitous crawling of social networks is of the utmost importance for researchers. Such a tool will allow the researcher to get the relevant needed information, allowing faster research in what really matters, without losing time on the development of his own crawler. In this paper, we present an extensive analysis of the existing social networks and their APIs, and also describe the conception and design of a social network crawler which will help NLP researchers.
- Unsupervised and Language Independent Approach to Extremism and Collective Radicalization UnderstandingPublication . Albardeiro, Miguel Ângelo Serra; Pais, Sebastião Augusto Rodrigues Figueiredo; Cordeiro, João Paulo da CostaIncreasingly in social media, we find cases where groups are organized to protest against something, often in those groups, members with extremist ideologies are inserted. These cases are happing more often, groups are created for the organization of peaceful protests and someone starts a topic with an extremist language leading, sometimes, to a radicalisation of the group. This research aims to create an approach that allows the detection of cases of extremism and collective radicalisation within social networks, this should be done in an unsupervised and independent of language way. The methods used to achieve the intended objectives are the creation of a lexicon of extreme sentiment terms named ExtremeSentiLex and a classifier of extreme sentiment in which the input is the extreme sentiment terms and the social network post. For the development of these tools were used purely statistical natural language processing methods. To validate the ExtremeSentiLex it was applied using the extreme sentiment classifier, the input posts that are analysed are posts from a dataset already validated by the scientific community. For a comparative study, word embeddings are used to expand the first ExtremeSentiLex obtained and a test is also performed in which the ExtremeSentiLex is balanced and applied to a balanced polarity dataset. The results obtained in this content level research that will be available to the scientific community are the ExtremeSentiLex and several datasets that were evaluated by us regarding the presence of extreme sentiment. At the level of tests performed when the ExtremeSentiLex was validated, the level of precision in finding extreme sentiment at the correct polarity was very high. When applying word embeddings the results dropped. Regarding the ExtremeSentiLex and balanced dataset, the results were very positive. It has been concluded that our dataset is suitable for the application in detecting extreme sentiments in text. Furthermore, it was found that with the help of linguistic and psychological experts the ExtremeSentiLex could be improved. However, this investigation aimed to do so using purely statistical methods. This goal has been successfully achieved.