TY  - JOUR
T1  - Gravity Spy: Lessons Learned and a Path Forward
JF  - European Physical Journal Plus
Y1  - 2024
A1  - Michael Zevin
A1  - Corey B. Jackson
A1  - Zoheyr Doctor
A1  - Yunan Wu
A1  - Carsten Østerlund
A1  - L. Clifton Johnson
A1  - Christopher P. L. Berry
A1  - Kevin Crowston
A1  - Scott B. Coughlin
A1  - Vicky Kalogera
A1  - Sharan Banagiri
A1  - Derek Davis
A1  - Jane Glanzer
A1  - Renzhi Hao
A1  - Aggelos K. Katsaggelos
A1  - Oli Patane
A1  - Jennifer Sanchez
A1  - Joshua Smith
A1  - Siddharth Soni
A1  - Laura Trouille
A1  - Marissa Walker
A1  - Irina Aerith
A1  - Wilfried Domainko
A1  - Victor-Georges Baranowski
A1  - Gerhard Niklasch
A1  - Barbara Téglás
AB  - <p>The Gravity Spy project aims to uncover the origins of glitches, transient bursts of noise that hamper analysis of gravitational-wave data. By using both the work of citizen-science volunteers and machine-learning algorithms, the Gravity Spy project enables reliable classification of glitches. Citizen science and machine learning are intrinsically coupled within the Gravity Spy framework, with machine-learning classifications providing a rapid first-pass classification of the dataset and enabling tiered volunteer training, and volunteer-based classifications verifying the machine classifications, bolstering the machine-learning training set and identifying new morphological classes of glitches. These classifications are now routinely used in studies characterizing the performance of the LIGO gravitational-wave detectors. Providing the volunteers with a training framework that teaches them to classify a wide range of glitches, as well as additional tools to aid their investigations of interesting glitches, empowers them to make discoveries of new classes of glitches. This demonstrates that, when giving suitable support, volunteers can go beyond simple classification tasks to identify new features in data at a level comparable to domain experts. The Gravity Spy project is now providing volunteers with more complicated data that includes auxiliary monitors of the detector to identify the root cause of glitches.</p>
VL  - 139
ER  - 

TY  - JOUR
T1  - Knowledge Tracing to Model Learning in Online Citizen Science Projects
JF  - IEEE Transactions on Learning Technologies
Y1  - 2020
A1  - Kevin Crowston
A1  - Carsten Østerlund
A1  - Tae Kyoung Lee
A1  - Corey Brian Jackson
A1  - Mahboobeh Harandi
A1  - Sarah Allen
A1  - Sara Bahaadini
A1  - Scott Coughlin
A1  - Aggelos Katsaggelos
A1  - Shane Larson
A1  - Neda Rohani
A1  - Joshua Smith
A1  - Laura Trouille
A1  - Michael Zevin
AB  - <p>We present the design of a citizen science system that uses machine learning to guide the presentation of image classification tasks to newcomers to help them more quickly learn how to do the task while still contributing to the work of the project. A Bayesian model for tracking volunteer learning for training with tasks with uncertain outcomes is presented and fit to data from 12,986 volunteer contributors. The model can be used both to estimate the ability of volunteers and to decide the classification of an image. A simulation of the model applied to volunteer promotion and image retirement suggests that the model requires fewer classifications than the current system.</p>
VL  - 13
ER  - 

TY  - JOUR
T1  - Teaching Citizen Scientists to Categorize Glitches using Machine-Learning-Guided Training
JF  - Computers in Human Behavior
Y1  - 2020
A1  - Corey Jackson
A1  - Carsten Østerlund
A1  - Kevin Crowston
A1  - Mahboobeh Harandi
A1  - Sarah Allen
A1  - Sara Bahaadini
A1  - Scott Coughlin
A1  - Vicky Kalogera
A1  - Aggelos Katsaggelos
A1  - Shane Larson
A1  - Neda Rohani
A1  - Joshua Smith
A1  - Laura Trouille
A1  - Michael Zevin
AB  - <p>Training users in online communities is important for making high performing contributors. However, several conundrums exists in choosing the most effective approaches to training users. For example, if it takes time to learn to do the task correctly, then the initial contributions may not be of high enough quality to be useful. We conducted an online field experiment where we recruited users (N = 386) in a web-based citizen-science project to evaluate the two training approaches. In one training regime, users received one-time training and were asked to learn and apply twenty classes to the data. In the other approach, users were gradually exposed to classes of data that were selected by trained machine learning algorithms as being members of particular classes. The results of our analysis revealed that the gradual training produced “high performing contributors”. In our comparison of the treatment and control groups we found users who experienced gradual training performed significantly better on the task (an average accuracy of 90% vs. 54%), contributed more work (an average of 228 vs. 121 classifications), and were retained in the project for a longer period of time (an average of 2.5 vs. 2 sessions). The results suggests online production communities seeking to train newcomers would benefit from training regimes that gradually introduce them to the work of the project using real tasks.</p>
VL  - 105
ER  - 

TY  - JOUR
T1  - Classifying the unknown: Discovering novel gravitational-wave detector glitches using similarity learning
JF  - Physical Review D
Y1  - 2019
A1  - Scott Coughlin
A1  - Sara Bahaadini
A1  - Neda Rohani
A1  - Michael Zevin
A1  - Patane, Oli
A1  - Mahboobeh Harandi
A1  - Corey Brian Jackson
A1  - Noroozi, V.
A1  - Sarah Allen
A1  - Areeda, J.
A1  - Coughlin, M.
A1  - Ruiz, P.
A1  - Berry, C. P. L.
A1  - Kevin Crowston
A1  - Aggelos Katsaggelos
A1  - Andrew Lundgren
A1  - Carsten Østerlund
A1  - Joshua Smith
A1  - Laura Trouille
A1  - Vicky Kalogera
AB  - <p>The observation of gravitational waves from compact binary coalescences by LIGO and Virgo has begun a new era in astronomy. A critical challenge in making detections is determining whether loud transient features in the data are caused by gravitational waves or by instrumental or environmental sources. The citizen-science project Gravity Spy has been demonstrated as an efficient infrastructure for classifying known types of noise transients (glitches) through a combination of data analysis performed by both citizen volunteers and machine learning. We present the next iteration of this project, using similarity indices to empower citizen scientists to create large data sets of unknown transients, which can then be used to facilitate supervised machine-learning characterization. This new evolution aims to alleviate a persistent challenge that plagues both citizen-science and instrumental detector work: the ability to build large samples of relatively rare events. Using two families of transient noise that appeared unexpectedly during LIGO's second observing run, we demonstrate the impact that the similarity indices could have had on finding these new glitch types in the Gravity Spy program.</p>
VL  - 99
IS  - 8
ER  - 

TY  - JOUR
T1  - Gravity Spy: Integrating Advanced LIGO Detector Characterization, Machine Learning, and Citizen Science
JF  - Classical and Quantum Gravity
Y1  - 2017
A1  - Michael Zevin
A1  - Scott Coughlin
A1  - Sara Bahaadini
A1  - Emre Besler
A1  - Neda Rohani
A1  - Sarah Allen
A1  - Miriam Cabero
A1  - Kevin Crowston
A1  - Aggelos Katsaggelos
A1  - Shane Larson
A1  - Tae Kyoung Lee
A1  - Chris Lintott
A1  - Tyson Littenberg
A1  - Andrew Lundgren
A1  - Carsten Oesterlund
A1  - Joshua Smith
A1  - Laura Trouille
A1  - Vicky Kalogera
VL  - 34
ER  -