ICWSM-16 - Program - Tutorial (original) (raw)

Tutorial Program

ICWSM-16 Tutorial Schedule

Tuesday, May 17

MORNING

8.30 AM -10.30 AM

TA1: Mining Smartphone Mobility Data
Spiros Papadimitriou, Tina Eliassi-Rad
Room Gereon, Maternushaus

8.30 AM -12.30 PM

TA2: Psychology for Computer Scientists: Fundamental Orientation and Frameworks
Johannes Eichstaedt, Michal Kosinski
Room Lambertus, Maternushaus

8.30 AM -12.30 PM

TA3: Using Crowdsourcing E ffectively for Social Media Research
Ujwal Gadiraju, Gianluca Demartini, Djellel Eddine Difallah, Michele Catasta
Room A212, GESIS

AFTERNOON

1:30 PM - 6:00 PM

TP4: Critical Review of Online Social Data: Limitations, Ethical Challenges, and Current Solutions
Carlos Castillo, Fernando Diaz, Emre Kiciman, Alexandra Olteanu
Room Lambertus, Maternushaus

1:30 PM - 6:00 PM

TP5: The Lifecycle of Geotagged Social Media Data
Rossano Schifanella, Bart Thomee
Training room, GESIS

MORNING AND AFTERNOON

11.00 AM - 12.30 PM + 1:30 PM - 5:30 PM

TAP6: The Web of Cities and Mobility
Bruno Goncalves, Anastasios Noulas, Konstantinos Pelechrinis, Daniele Quercia
Room Gereon, Maternushaus

The details of each tutorial can be found below:

TA1: Mining Smartphone Mobility Data

The recent availability of reasonably fast wireless and mobile data networks has spurred demand for more capable mobile computing devices. Conversely, the emergence of new devices has created further demand for better networks, creating an innovation cycle. The current concept of a smartphone as an always-connected computing device with multiple sensing modalities was brought into the mainstream by the Apple iPhone less than a decade ago. Such devices are now seeing an explosive growth. Additionally, for many people in the world, such devices are the first computers they use. Furthermore, small, cheap, always-connected devices with broad, extensible sensing capabilities are very recently emerging (e.g., standalone or peripheral), further blurring the lines between the physical and virtual worlds. All of this opens up countless possibilities for data collection and analysis across a broad range of applications. In this tutorial, we survey the state-of-the-art in terms of mining smartphone mobility data across different application areas. Our tutorial consists of three parts. First, we discuss the possibilities and challenges in the collection of data from various sensing modalities on smartphones. Second, we present cross-cutting challenges and algorithms in sensing and localization. Third, we cover a broad classes of applications--notably mobile health, location-based social networks, and mobile advertising. We conclude by showcasing the opportunities for new data collection techniques and new data-mining methods to meet the challenges and applications that are unique to the smartphone mobile arena.

Organizers

TA2: Psychology for Computer Scientists: Fundamental Orientation and Frameworks

Most researchers who use social media are in the business of understanding behavior, and often at scales unbeknownst to traditional psychology. In all likelihood some of the great future breakthroughs in psychology will be discovered by computational scientists. Yet most computational researchers have limited training in the social sciences. We believe that a basic understanding of psychological processes would aid in both understanding findings, as well as identifying striking results that might of interest to the social science community at large.

Fortunately, across the last 50 years, a number of influential theories have emerged in psychology that have helped organize the field tremendously and that set benchmarks for how individuals behave in a given context. The more computational scientists are familiar with these basic frameworks, the easier it will be to identify relevant points of contact with the existing literature.

This tutorial will review some of the currently most widely accepted fundamental theories of psychology, starting with basic brain function that span needs, lifespan and moral development, dimensional theories of emotion and interpersonal interaction, states vs. traits, personality models, and psychological assessment. Our hope is that this tutorial will leave the audience with a foundational orientation in psychology, to aid in the integration of computational findings with psychological theory.

Organizers

TA3: How to Use Crowdsourcing Effectively for Social Media Research

Since the term crowdsourcing was coined in 2005, we have witnessed a surge in the adoption of the crowdsourcing paradigm. Crowdsourcing solutions are highly sought-after to solve problems that require human intelligence at a large scale. In the last decade there have been numerous applications of crowdsourcing spanning several domains in both research and for practical benefits across disciplines (from sociology to computer science). In the realm of research practice, crowdsourcing has unmistakably broken the barriers of qualitative and quantitative studies by providing a means to scale-up previously constrained laboratory studies and controlled experiments. Today, one can easily build ground truths for evaluation, access potential participants around the clock with diverse demographics at will, and all within an unprecedentedly short amount of time. This also comes with a number of challenges related to lack of control on research subjects and to data quality.

In this tutorial, we will introduce the crowdsourcing paradigm in its entirety. We will discuss altruistic and reward-based crowdsourcing, eclipsing the needs of task requesters as well as the behavior of crowd workers. The tutorial will focus on paid microtask crowdsourcing, and reflect on the challenges and opportunities that confront us. In an interactive demonstration session, we will run the audience through the entire life-cycle of creating and deploying microtasks on an established crowdsourcing platform, optimizing task settings in order to meet task needs, and aggregating results thereafter. We will present a selection of state-of-the-art methods to ensure high-quality results and inhibit malicious activity. The tutorial will be framed within the context of Social Media. The human element at the core of all Social Media breeds a rich ground for crowdsourcing, and we aim to spread the virtues of this growing field.

Organizers

TP4: A Critical Review of Online Social Data: Limitations, Ethical Challenges, and Current Solutions

This tutorial aims to carefully and critically scrutinize the use of online social datasets for research, against a variety of possible data, methodological, and ethical pitfalls, by systematically overviewing prior work that identifies, quantifies and provides solutions to them.

To set the context, we will first provide examples of typical limitations, trade-offs or mistakes in current research aims and practices. Then, we will scrutinize the representativeness of social datasets, covering major classes of data biases including population, behavioral, and collection biases, as well as other quality issues such as data decay and temporal variations. Particular attention will be given to issues related to the design and evaluation of methods for collecting or processing social datasets. Finally, we cover various ethical caveats such as algorithmic reinforcement of discriminatory treatment and existing prejudice, and the risk of privacy breaches.

The tutorial will also include two hands-on sessions, where participants will have the opportunity to explore and debate about different types of data biases and effects of design decisions, and to jointly evaluate example research projects given by the tutors. Real-world datasets and code templates will be provided.

Organizers

TP5: The Lifecycle of Geotagged Social Media Data

In this tutorial we cover the four stages that are part of the lifecycle of geotagged social media data in research, namely representing, processing, analyzing, and visualizing. The tutorial aims to arm participants with both theoretical and practical knowledge about how to make sense of geospatial data for use in applications that range from computational social science and social media analysis to behavioral studies on digital platforms. We provide the basics on how to obtain, represent and combine different spatial data sources, with an accent on how to efficiently store, index and query a location-based dataset. We further discuss the main techniques on how to derive insights from spatial data, how to avoid common pitfalls and how to exploit social media (e.g. user interests, user movements) for the purpose of gaining a deeper understanding of the phenomenon under study. The tutorial will end with an overview of the main libraries and paradigms to build interactive and dynamic visualizations of geographical data on a map.

Organizers

TAP6: The Web of Cities and Mobility

Mining spatial data has been a core subject of study in the data mining community over the past years. Most of scholarly research has focused on the analysis of GPS traces and place recommendations. More recently however, new layers (e.g. social, semantic, linguistic) of big location data have emerged. Given the unprecedented levels of urbanization experienced in the last decade, among the most challenging and crucial ones is the urban fabric layer. The latter includes information that ranges from data related to transportation and navigation in a city to data that are related with the local economy. To integrate urban studies with the research agendas revolving around traditional data mining conferences, it has become clear that a basic introduction to urban studies is needed. The goal of this tutorial is twofold; (a) to provide this introduction in a form that is focused on topics most relevant to the ICWSM community and, (b) to introduce its attendants to the state-of-the-art in the analysis and modeling in this new regime of spatial data with a special focus on urban applications.

Organizers