Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Apr 15;9(5):107442.
doi: 10.1016/j.cdnut.2025.107442. eCollection 2025 May.

Population Recruitment Strategies in the Age of Bots: Insights from the What Is on Your Plate Study

Affiliations

Population Recruitment Strategies in the Age of Bots: Insights from the What Is on Your Plate Study

Emily G Elenio et al. Curr Dev Nutr. .

Abstract

Background: To evaluate state-wide nutrition policies, valid tools are required to gather sufficient sample sizes. Remote data collection, including web-based dietary assessments, offers convenience for participants and researchers and enables faster and more diverse recruitment. However, it presents challenges, including risk of bots compromising data integrity.

Objectives: This study describes the technical survey design of an ongoing longitudinal study, which is evaluating a state-wide Supplemental Nutrition Assistance Program (SNAP) incentive program, discusses strategies to prevent and identify bots, duplicates, fraudulent entries, and implausible data, and provides recommendations to improve future public health nutrition research.

Methods: From May to September 2023, SNAP participants from Rhode Island and Connecticut were recruited to complete an online food frequency questionnaire (FFQ) and a demographic survey. Given the large sample and online format, our interdisciplinary team designed the technical backend to optimize participants' convenience while ensuring data quality through an automated system that assessed FFQ responses. To prevent bots and duplicates, we created duplicate application programming interfaces (API), randomly called participants, and evaluated Completely Automated Public Turing Test to Tell Computers and Humans Apart (reCAPTCHA), geotags, and Internet Protocol (IP) addresses.

Results: Using a combination of text blasts and in-person recruitment, we enrolled 1367 participants, with text blasts proving the most effective strategy (∼60% of participants). Midway through recruitment, we identified 544 potential bots that completed the screener, with duplicate IP addresses and geotags from outside the recruitment area serving as strong indicators of bot activity. At baseline, 112 participants failed FFQ data quality checks, prompting follow-up by research assistants. Our automated duplicate and FFQ APIs saved countless hours of staff time.

Conclusions: Remote data collection tools were critical for meeting recruitment goals and ensuring our data authenticity. A combination of strategies is necessary to effectively mitigate against bots and ensure plausible responses. Widely available, built-in tools (e.g., reCAPTCHA) are helpful but are insufficient alone. Customized solutions like our automated systems may be critical for future researchers to maintain data integrity.

Keywords: Supplemental Nutrition Assistance Program; bots; data integrity; dietary assessment; fraud prevention; online survey security; web-based food frequency questionnaire.

PubMed Disclaimer

Conflict of interest statement

The authors do not have any conflicts of interest to disclose.

Figures

FIGURE 1
FIGURE 1
Baseline survey structure. 1Duplicate flag as detected by Qualtrics or duplicate email or phone detected by the API. 2Flag triggered if age provided in screener did not match date of birth in full survey and/or if participant completed food frequency questionnaire in <10 min, or if participant reported < 600 kcals or > 10,000 kcal. APIs, application programming interfaces; AWS, Amazon Web Services; DOB, date of birth; FFQ, food frequency questionnaire; QA, quality assurance; QR, quick response.
FIGURE 2
FIGURE 2
Follow-up survey structure. 1Receipt flag if receipt does not show EBT card was used, receipt older than 30 days from submission date, or receipt is a duplicate (duplicate photo or EBT card number). 2QA flag triggered if age provided in screener did not match date of birth in full survey and/or if participant completed food frequency questionnaire in < 10 min, or if participant reported < 600 kcals or > 10,000 kcal. APIs, application programming interfaces; EBT, electronic benefit transfer; FFQ, food frequency questionnaire; QA, quality assurance.
FIGURE 3
FIGURE 3
Quality assurance data check process and structure. APIs, application programming interfaces; FFQ, food frequency questionnaire; QA, quality assurance.

Similar articles

Cited by

References

    1. Thornton L., Batterham P.J., Fassnacht D.B., Kay-Lambkin F., Calear A.L., Hunt S. Recruiting for health, medical or psychosocial research using Facebook: systematic review. Internet Interv. 2016;4:72–81. - PMC - PubMed
    1. Arevalo M., Brownstein N.C., Whiting J., Meade C.D., Gwede C.K., Vadaparampil S.T., et al. Strategies and lessons learned during cleaning of data from research panel participants: cross-sectional web-based health behavior survey study. JMIR Form Res. 2022;6(6) - PMC - PubMed
    1. Hensen B., Mackworth-Young C.R.S., Simwinga M., Abdelmagid N., Banda J., Mavodza C., et al. Remote data collection for public health research in a COVID-19 era: ethical implications, challenges and opportunities. Health Policy Plan. 2021;36(3):360–368. - PMC - PubMed
    1. Leonard A., Hutchesson M., Patterson A., Chalmers K., Collins C. Recruitment and retention of young women into nutrition research studies: practical considerations. Trials. 2014;15(1):23. - PMC - PubMed
    1. Evans J.R., Mathur A. The value of online surveys: a look back and a look ahead. Internet Res. 2018;28(4):854–887.

LinkOut - more resources