Blog

VOSON R tools for social media data collection and network analysis

The Virtual Observatory for the Study of Online Networks (VOSON) Lab in the School of Sociology at the ANU was formally established as part of a 2005 Australian Research Council Special Research Initiative (e-Research) grant. One of the activities of the VOSON Lab is the development of open-source tools that incorporate social science methods (in particular, Social Network Analysis) for collection and analysis of online behaviour. 

vosonSML (available on CRAN and GitHub) is an R package for the collection of social media data (Twitter, YouTube and Reddit) and the creation of various social networks for analysis.

vosonSML GitHub page: https://github.com/vosonlab/vosonSML 
vosonSML vignette : https://vosonlab.github.io/vosonSML/articles/Intro-to-vosonSML.html 

VOSON Dashboard (available on CRAN and GitHub) is an interactive R shiny app that integrates social media data collection (via vosonSML),  network visualisation and analysis tools. 

VOSON Dashboard GitHub page: https://github.com/vosonlab/VOSONDash 
VOSON Dashboard userguide: https://vosonlab.github.io/VOSONDashDocs/

Open source tools for social network collection and analysis

Gephi  

https://gephi.org

Gephi is an open-source tool for network visualisation and analysis. It provides features to visually manipulate a network. 

Citation: Bastian M., Heymann S., Jacomy M. (2009). Gephi: an open-source software for exploring and manipulating networks. International AAAI Conference on Weblogs and Social Media. 

NetworkX/Python 

https://networkx.org/documentation/stable/index.html

NetworkX is a Python package for the creation, manipulation, and study of the structure, dynamics, and functions of complex networks. 

Citation: Aric A. Hagberg, Daniel A. Schult and Pieter J. Swart, “Exploring network structure, dynamics, and function using NetworkX”, in Proceedings of the 7th Python in Science Conference (SciPy2008), Gäel Varoquaux, Travis Vaught, and Jarrod Millman (Eds), (Pasadena, CA USA), pp. 11–15, Aug 2008 

NodeXL 

https://www.smrfoundation.org/nodexl/

NodeXL Basic (free version) is an add-in for Microsoft Excel that supports social network and content analysis.   

Citation: Smith, M., Ceni A., Milic-Frayling, N., Shneiderman, B., Mendes Rodrigues, E., Leskovec, J., Dunne, C., (2010). NodeXL: a free and open network overview, discovery and exploration add-in for Excel 2007/2010/2013/2016, from the Social Media Research Foundation.

R/igraph 

https://igraph.org

igraph is an open-source package which integrates network analysis tools (network metrics and visualisation). igraph can be programmed in Pyhton, Mathematica and C/C++. 
 
Manual pages: https://igraph.org/r/doc/ 

Citation: Csardi G, Nepusz T (2006). “The igraph software package for complex network research.” InterJournal, Complex Systems, 1695.

R/vosonSML and R/VOSONDash

vosonSML (available on CRAN and GitHub) is an R package for the collection of social media data (Twitter, YouTube and Reddit) and the creation of various social networks for analysis. VOSON Dashboard (available on CRAN and GitHub) is an interactive R shiny app that integrates social media data collection (via vosonSML),  network visualisation and analysis tools. 

vosonSML and VOSONDash are developed at the VOSON Lab, School of Sociology, ANU.

vosonSML GitHub page: https://github.com/vosonlab/vosonSML 
vosonSML vignette : https://vosonlab.github.io/vosonSML/articles/Intro-to-vosonSML.html 

VOSON Dashboard GitHub page: https://github.com/vosonlab/VOSONDash 
VOSON Dashboard userguide: https://vosonlab.github.io/VOSONDashDocs/

R/statnet

http://statnet.org/

statnet is a suite of R packages for the management, exploration, statistical analysis, simulation and vizualization of network data.

Citation: Statnet Development Team, (Pavel N. Krivitsky, Mark S. Handcock, David R. Hunter, Carter T. Butts, Chad Klumb, Steven M. Goodreau, and Martina Morris) (2003-2020).
statnet: Software tools for the Statistical Modeling of Network Data.

ReDesigning Research: Ethics and reframing protocols to be COVID-safe

Fieldwork Futures Seminar 4

In response to the ANU College of Arts and Social Sciences research fieldwork COVID-19 check-in survey, a series of seminars have been designed to support fieldwork researchers in this difficult time. These offer forums for discussion, ideas for redirecting research or modifying methodologies, and opportunities for networking.

Speakers:

Dr. Caroline Schuster, School of Archaeology and Anthropology, RSHA

Prof. Bruce Smyth, Centre for Social Research Methods, RSSS

Chair: 

Dr. Carmel O’Shannessy, School of Literature, Languages and Linguistics, RSHA

ReDesigning Research: Exploring methodologies 2

Fieldwork Futures Seminar 3

Tuesday 20 October 2020, 12.30-1.30pm

In response to the ANU College of Arts and Social Sciences research fieldwork COVID-19 check-in survey, a series of seminars have been designed to support fieldwork researchers in this difficult time. These offer forums for discussion, ideas for redirecting research or modifying methodologies, and opportunities for networking.

Speakers 

ReDesigning Research: Working with community field researchers in a COVID context

Fieldwork Futures Seminar 2

Thurs 15 October 2020, 12-1pm

In response to the ANU College of Arts and Social Sciences research fieldwork COVID-19 check-in survey, a series of seminars have been designed to support fieldwork researchers in this difficult time. These offer forums for discussion, ideas for redirecting research or modifying methodologies, and opportunities for networking.

Speakers 

ReDesigning Research: Exploring methodologies 1

Fieldwork Futures Seminar 1

Wednesday 7 October 2020, 12-1pm

In response to the ANU College of Arts and Social Sciences research fieldwork COVID-19 check-in survey, a series of seminars have been designed to support fieldwork researchers in this difficult time. These offer forums for discussion, ideas for redirecting research or modifying methodologies, and opportunities for networking.

Speakers 

Datasets of languages data

Many language corpora are available for researchers. Each one contains details of levels of access. Many of the corpora have associated corpus tools to use for analysis. 

A collection of hundreds of data sets of spoken language, in over 34 languages. Includes adult and child language, conversation and more formal genres, and multilingual interactions. All transcribed. Carnegie Mellon University. 

The Language Archive. An archive of audio and video spoken language from around the world, including many languages with small numbers of speakers. Includes naturalistic and elicited production recordings.  Max Planck Institute for Psycholinguistics in Nijmegen

Endangered Languages Archive. Contains audio and video recordings of many endangered languages around the world. SOAS University of London

Spoken language corpora from languages around the world. Often transcribed and with English glosses. ARC Centre of Excellence for the Dynamics of Language. 

An archive of spoken language from many small languages around the world. Include digitizations of data recorded in analog formats. Over 1,200 languages represented.

Scripture resources in Australian Indigenous languages, including full texts of Bibles. 

https://www.english-corpora.org

Spoken and written corpora in English in many different genres. Includes American and British varieties of English. 

https://www.corpusdelespanol.org

Corpus of historical and contemporary Spanish in many genres as spoken in many different countries.

https://slaap.chass.ncsu.edu

An interactive web-based archive of sociolinguistic recordings, with integrated media playing and annotation features, plus other corpus tools. North Carolina State University

https://buckeyecorpus.osu.edu

Conversational speech recordings from 40 speakers in Columbus, Ohio conversing freely with an interviewer. Transcribed. 

Contains 500 ready-to-use written text corpora in over 90 languages. Includes corpus tools.

https://multicast.aspra.uni-bamberg.de

A collection of annotated texts from a typologically diverse array of languages. Time-aligned annotations with audio recordings, in formats suitable for cross-corpus typological research.

http://doreco.info/

Collection of spoken language corpora from about 50 languages, extracted from documentations of small and often endangered languages. Transcribed with time-aligned annotations. 

Fieldwork Futures Seminar Series

In response to the CASS research fieldwork COVID-19 check-in survey, a series of seminars have been designed to support fieldwork researchers in this difficult time. These offer forums for discussion, ideas for redirecting research or modifying methodologies, and opportunities for networking. Following the speaker presentations there will be open discussion. (The speakers will be recorded, but not the discussion.)

You are cordially invited to these seminars

Same link for each seminar: Zoom link

ReDesigning Research – Exploring methodologies 2

Tues Oct 20, 12.30 – 1.30pm

Speakers

Chair: Dr. Burcu Cevik-Compiegne, Centre for Arab and Islamic Studies, RSSS

Ethics and reframing protocols to be COVID-safe

Tues 3 November, 11.30am – 12.30pm

Speakers

Chair: Assoc. Prof. Deirdre Howard-Wagner, Centre for Aboriginal Economic Policy Research, RSSS

No need to register

Enquiries:
Carmel.O’Shannessy@anu.edu.au