• Aucun résultat trouvé

Computational Social Science and Microblogs — The Good, the Bad and the Ugly

N/A
N/A
Protected

Academic year: 2022

Partager "Computational Social Science and Microblogs — The Good, the Bad and the Ugly"

Copied!
1
0
0

Texte intégral

(1)

Computational Social Science and Microblogs

The good, the bad and the ugly

Markus Strohmaier

GESIS & University of Koblenz Unter Sachsenhausen 6-8 50667 Cologne, Germany

markus.strohmaier@gesis.org ABSTRACT

According to the Computational Social Science Society of the Americas (CSSSA), computational social science is “The science that investigates social phenomena through the medium of computing and related advanced information processing technologies”. Positioned between the computer and so- cial sciences, this new and emerging interdisciplinary field is fuelled by at least the following two developments: (i) availability of data: With the web, a huge volume of social data is now available which enables the study of traces of social interactions on new scales. (ii)increasing quantifica- tion of social theories: With recent advances in the social sciences, social theories become increasingly formal and/or mathematical and thus amenable to quantification. Taken together, these two developments give rise to a whole range of new and interesting problems on the intersection between computer and social sciences. While a multitude of social data is available on the World Wide Web, microblogs are of particular interest due to their real-time nature, their rich social fabric and their presumed on/offline coupling. In this talk, I am going to talk about the potentials and the chal- lenges of doing computational social science based on data obtained from microblogs such as Twitter. In particular, I want to present previous work by my group and others to identify research avenues where progress has already been made or where progress is on the horizon, and contrast these with what I feel are open research challenges in this emerging field. Work that demonstrates the potential of microblogs for computational social science includes for example [1], where we have operationalized a number of theoretical con- structs from sociology to characterize the nature of online conversational practices of political parties on Twitter. In another work, we have studied the ways in which users’ fields of expertise can be inferred from microblog data [4]. Work that demonstrates the pitfalls and challenges of doing com- putational social science with microblog data include for ex- ample [5] where we have studied a network of bots who are competing against each other in attacking users on Twitter.

In subsequent work, we have found that such attacks have

the potential to impact the social graph of Twitter [3], i.e.

the network of who follows whom respectively who replies to whom. In other work, [2] have shown that there is a stark difference between the demographics of Twitter and the general population of the US, finding that Twitter users significantly over-represent densely populated regions and are predominantly male. I will argue that these and other factors need to be considered when we aim to unlock the full potential of microblog data for computational social science purposes.

Categories and Subject Descriptors

J.4 [Social and behavioral sciences]: Sociology; I.6.0 [Simulation and Modeling]: General

General Terms

Experimentation, Human Factors

Keywords

Social data, computational social science, social behavior, web science, online social networks

1. REFERENCES

[1] H. Lietz, C. Wagner, A. Bleier, and M. Strohmaier.

When politicians talk: Assessing online conversational practices of political parties on twitter. InInternational AAAI Conference on Weblogs and Social Media (ICWSM2014), Ann Arbor, MI, USA, June 2-4, 2014.

[2] A. Mislove, S. Lehmann, Y.-Y. Ahn, J.-P. Onnela, and J. N. Rosenquist. Understanding the demographics of twitter users. InInternational AAAI Conference on Weblogs and Social Media (ICWSM2011), Barcelona, Spain, July 17-21, 2011.

[3] S. Mitter, C. Wagner, and M. Strohmaier.

Understanding the impact of socialbot attacks in online social networks. InACM Web Science 2013, May 2-4th, Paris, France, 2013. (Extended Abstract).

[4] C. Wagner, V. Liao, P. Pirolli, L. Nelson, and M. Strohmaier. It’s not in their tweets: Modeling topical expertise of twitter users. InThe 4th IEEE International Conference on Social Computing

(SocialCom 2012), Amsterdam, The Netherlands, 2012.

[5] C. Wagner, S. Mitter, C. Koerner, and M. Strohmaier.

When social bots attack: Modeling susceptibility of users in online social networks. InProceedings of the WWW’12 Workshop on ’Making Sense of Microposts’

(MSM2012). CEUR-WS, 2012.

Copyright c2014 held by author(s)/owner(s); copying permitted only for private and academic purposes.

Published as part of the #Microposts2014 Workshop proceedings, available online as CEUR Vol-1141 (http://ceur-ws.org/Vol-1141)

#Microposts2014, April 7th, 2014, Seoul, Korea.

·#Microposts2014·4th Workshop on Making Sense of Microposts·@WWW2014 1

Références

Documents relatifs

The classification from the ANN seems to give better results than the other methods. This is expected from supervised meth- ods where the training is performed on pre-defined class

After a hiatus of about 15 years, preprint platforms have become popular in many disciplines (e.g., bioRxiv for biological sciences) due to the increasing drive towards

IV-A could be used at least when T is Base (defined in Sec.II-B). Following the same scheme as in Sec. IV-A, we want to add all Coq functions from natural numbers to closed terms,

byas na ’ong zer ba la rwa chen gyis khyod su yin / chos ci shes gsungs pas / kho na re / nga mu stegs purṇa nag po yin / chos rig byed bzhi la sogs pa shes zer bas / rwa chen

In particular, this article analyses the Dansk Industri judgment of the Danish Supreme Court, where the Danish court decided not to apply a preliminary ruling of the Court

The mathematicians’ Digital mathematics library (DML), which is not to be confused with libraries of mathematical objects repre- sented in some digital format, is the generous idea

The five “exceptional cases” are the already mentioned CRAS, handled by Gallica, two Elsevier journals whose titles are not currently owned by an academic institution (which are

To obtain the original data, as a first step it was decided to write software that allows access to the social network through special API and obtain the necessary data from it,