Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2022 Jan 3;3(3):100423.
doi: 10.1016/j.patter.2021.100423. eCollection 2022 Mar 11.

A guide to backward paper writing for the data sciences

Affiliations
Review

A guide to backward paper writing for the data sciences

Jon Zelner et al. Patterns (N Y). .

Abstract

In this perspective, we outline a set of best practices for the planning, writing, and revision of scientific papers and other forms of professional communication in the data sciences. We propose a backward approach that begins with clearly identifying the scientific and professional goals motivating the work, followed by a purposeful mapping from those goals to each section of a paper. This approach is motivated by the conviction that manuscript writing can be more effective, efficient, creative, and even enjoyable-particularly for early-career researchers-when the overarching goals of the paper and its individual components are clearly mapped out.

PubMed Disclaimer

Conflict of interest statement

The authors declare no competing interests.

Figures

Figure 1
Figure 1
Schematic representation of the process of backward paper writing The high-level steps involved in the process of backward data science manuscript preparation. The square boxes at the top represent the important pre-writing steps in which you clarify the scientific and professional goals motivating your work. The rounded box represents the process of initial writing and revision. Once a draft is complete, the diamond box represents circulating the manuscript to colleagues and mentors for feedback, or submitting for publication, with the expectation that this will result in further revision and updating of your work. The circle represents the typical endpoint of the process: publication in a peer-reviewed outlet, sharing publicly via a preprint server, publishing online via an interactive notebook or app, or the many other ways in which data science research can be disseminated to relevant scientific communities and the public at large. Finally, the dashed arrow represents the potential for post-publication revision in response to feedback and critique or new data. While not required, this type of post-publication revision is increasingly common in data science fields, allows for greater transparency, and may increase the long-term relevance of the published work.

References

    1. Nolan D., Stoudt S. 1st edition. Oxford University Press; 2021. Communicating with Data: The Art of Writing for Data Science.
    1. Curry S. Let’s move beyond the rhetoric: it’s time to change how we judge research. Nature. 2018;554:147. doi: 10.1038/d41586-018-01642-w. - DOI - PubMed
    1. Stern B.M., O’Shea E.K. A proposal for the future of scientific publishing in the life sciences. PLoS Biol. 2019;17:e3000116. doi: 10.1371/journal.pbio.3000116. - DOI - PMC - PubMed
    1. van der Aalst W. In: Process Mining: Data Science in Action. van der Aalst W., editor. Springer; 2016. Data science in action; pp. 3–23. - DOI
    1. Schulte B.A. Scientific writing & the scientific method: parallel “Hourglass” structure in form & content. Am. Biol. Teach. 2003;65:591–594. doi: 10.2307/4451568. - DOI

LinkOut - more resources