Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Nov 15;14(6):e156.
doi: 10.2196/jmir.2121.

"Right time, right place" health communication on Twitter: value and accuracy of location information

Affiliations

"Right time, right place" health communication on Twitter: value and accuracy of location information

Scott H Burton et al. J Med Internet Res. .

Abstract

Background: Twitter provides various types of location data, including exact Global Positioning System (GPS) coordinates, which could be used for infoveillance and infodemiology (ie, the study and monitoring of online health information), health communication, and interventions. Despite its potential, Twitter location information is not well understood or well documented, limiting its public health utility.

Objective: The objective of this study was to document and describe the various types of location information available in Twitter. The different types of location data that can be ascertained from Twitter users are described. This information is key to informing future research on the availability, usability, and limitations of such location data.

Methods: Location data was gathered directly from Twitter using its application programming interface (API). The maximum tweets allowed by Twitter were gathered (1% of the total tweets) over 2 separate weeks in October and November 2011. The final dataset consisted of 23.8 million tweets from 9.5 million unique users. Frequencies for each of the location options were calculated to determine the prevalence of the various location data options by region of the world, time zone, and state within the United States. Data from the US Census Bureau were also compiled to determine population proportions in each state, and Pearson correlation coefficients were used to compare each state's population with the number of Twitter users who enable the GPS location option.

Results: The GPS location data could be ascertained for 2.02% of tweets and 2.70% of unique users. Using a simple text-matching approach, 17.13% of user profiles in the 4 continental US time zones were able to be used to determine the user's city and state. Agreement between GPS data and data from the text-matching approach was high (87.69%). Furthermore, there was a significant correlation between the number of Twitter users per state and the 2010 US Census state populations (r ≥ 0.97, P < .001).

Conclusions: Health researchers exploring ways to use Twitter data for disease surveillance should be aware that the majority of tweets are not currently associated with an identifiable geographic location. Location can be identified for approximately 4 times the number of tweets using a straightforward text-matching process compared to using the GPS location information available in Twitter. Given the strong correlation between both data gathering methods, future research may consider using more qualitative approaches with higher yields, such as text mining, to acquire information about Twitter users' geographical location.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1
Figure 1
Distribution of Twitter users by time zone (time zones are aligned with longitudes not accounting for deviations based on country borders).
Figure 2
Figure 2
Twitter users providing location indicators in the US time zones.
Figure 3
Figure 3
The proportion of Twitter users identified in each state and the proportion of the 2010 US census population in each state, ordered by census population.
Figure 4
Figure 4
The number of geolocated Twitter users per capita in each state.

Similar articles

Cited by

References

    1. Pentland A, Lazer D, Brewer D, Heibeck T. Using reality mining to improve public health and medicine. Stud Health Technol Inform. 2009;149:93–102. - PubMed
    1. Estrin D, Sim I. Health care delivery. Open mHealth architecture: an engine for health care innovation. Science. 2010 Nov 5;330(6005):759–60. doi: 10.1126/science.1196187.330/6005/759 - DOI - PubMed
    1. Eke PI. Using social media for research and public health surveillance. J Dent Res. 2011 Sep;90(9):1045–6. doi: 10.1177/0022034511415277.0022034511415277 - DOI - PMC - PubMed
    1. Farmer AD, Bruckner Holt CE, Cook MJ, Hearing SD. Social networking sites: a novel portal for communication. Postgrad Med J. 2009 Sep;85(1007):455–9. doi: 10.1136/pgmj.2008.074674.85/1007/455 - DOI - PubMed
    1. Jain SH. Practicing medicine in the age of Facebook. N Engl J Med. 2009 Aug 13;361(7):649–51. doi: 10.1056/NEJMp0901277.361/7/649 - DOI - PubMed

LinkOut - more resources