Data Scientist (R&D Project)

  1. Home
  2. Remote jobs
  3. A/B Testing
  • Company Public Library of Science
  • Employment Full-time
  • Location 🇺🇸 United States nationwide
  • Submitted Posted 1 week ago - Updated 1 day ago
<h2><strong><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><sup>The position is anticipated to be short-term, and not expected to exceed December 31, 2025. Please note that there is no guaranteed duration of the position, and your employment will be at will and for no fixed duration. As such, it can be terminated by you or the organization at any time, with or without notice or cause, for any reason not otherwise prohibited by law.</sup></span></strong></h2><h2><strong><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">*This position is fully remote/home based. Applications will be accepted from candidates based in the UK and the following US states: FL, MA, MD, NY, PA, TX, VA.</span></strong></h2><p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><strong><em>PLOS is a nonprofit organization on a mission to drive open science forward with measurable, meaningful change in research publishing, policy, and practice. We believe in a better future where science is open to all, for all</em></strong></span></p><h2><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Role Summary</span></h2><p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Use data science to provide insight into the nature and structure of our data and content, both published content and internal data sets, and lead on developing models to improve processing, access, understanding and use of that data.</span></p><p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Working closely with the Subject Matter Experts, Product Managers, Software Engineers and Product Designers, you will play a key role in improving understanding of our content and data, improving how we manage, process and use that data in support of PLOS’s goals.</span></p><p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">You will be tasked with the large-scale analysis of our broad and varied collection of scholarly content, which includes research articles and associated data sets, and line of business data and information. This will require working with structured and unstructured data, a large corpus of scholarly articles, using programmatic techniques such as statistical analysis, natural language processing, information retrieval, and machine learning. You will also work with the rest of the team to turn your insights and software prototypes into production services that improve the utility of this data for both our end users and internal stakeholders.</span></p><h3><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Responsibilities</span></h3><ul><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Create and use machine learning models, statistical analysis, natural language processing to improve scientific content workflows, enhance discoverability, and support Open Science initiatives.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Collect, clean, and analyze large datasets of scientific content and related information from various sources, ensuring data quality and integrity.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Build and test predictive models and machine learning algorithms for tasks such as entity extraction, workflow automation, and enhancing the understanding of scientific content.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Visualize and present findings in a clear, concise, and compelling manner to both technical and non-technical audiences.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Work as part of a cross-functional team, contributing insights, models and code and deploying production services that improve our use of data.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Collaborate with editorial, marketing, product, and colleagues across PLOS to understand data needs and translate business requirements into analytical solutions that enable new open science capabilities.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Contribute to the development of data strategies and best practices within the organization and identify opportunities for workflow optimization and automation.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Engage with the latest research and trends in data science, Open Science, and scholarly publishing, proactively identifying opportunities to apply innovative techniques and refine best practices.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Consider the ethical implications of all data techniques as applied to our data, always ensuring that they are appropriate, take into account the potential for negative impact and do not bias research.</span></li></ul><h3><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Knowledge and Skills</span></h3><ul><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Extensive experience in statistical modeling, machine learning, and data mining techniques, with a focus on applications in text analysis or scientific data, including knowledge of forecasting, A/B testing, entity extraction, and feature engineering.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Proficiency in programming languages such as Python, R, and SQL, and data analysis libraries (e.g., Pandas, NumPy, SciPy, Tidyverse).</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Strong knowledge of machine learning frameworks and libraries (e.g., TensorFlow, PyTorch, scikit-learn, NLTK).</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Experience with NLP techniques, such as named entity recognition (NER), topic modeling, semantic similarity, and knowledge graph construction.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Demonstrated ability to communicate complex technical findings clearly and effectively, both verbally and in writing, through reports and presentations to diverse audiences.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Strong analytical and problem-solving skills, with a high degree of attention to detail and accuracy in handling scientific data.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Experience working with large datasets and database systems, and ideally with scientific content repositories or publishing platforms.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Familiarity with the scientific research environment, scholarly literature, and open science principles are an advantage.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Able to develop hypotheses based on quantitative and qualitative evidence</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Experience working with solid development practices, git, CI etc.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Ability to work effectively both independently and collaboratively within a remote, agile team environment.</span></li></ul><h3><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Qualifications</span></h3><ul><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">A Master's degree in a relevant field such as Data Science, Statistics, Computer Science, Bioinformatics, or a related quantitative discipline with a focus on scientific applications is preferred.</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Relevant work experience in a data science role within scientific publishing, research, or a related field is desirable.</span></li></ul><h3><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Physical Requirements and Work Environment</span></h3><ul><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Prolonged periods stationary at a desk and working on a computer</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Some national and international travel will be required</span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">Some flexibility to work across time zones</span></li></ul><p>&nbsp;</p><p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;">The base salary range we’ve established for this position is <strong>(US) $105,000 - $145,000.</strong>&nbsp;<span data-contrast="auto">PLOS also offers a comprehensive benefits package summarized below.</span><span data-ccp-props="{}">&nbsp;</span></span></p><p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><strong><span data-contrast="auto">BENEFITS:</span></strong><span data-ccp-props="{}">&nbsp;</span></span></p><p><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><strong><span data-contrast="auto">US:</span></strong><span data-ccp-props="{}">&nbsp;</span></span></p><ul><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span data-contrast="auto">401k with employer match</span><span data-ccp-props="{}">&nbsp;</span></span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span data-contrast="auto">Employee sponsored health, dental and vision insurance (Dental and Vision 100% employer paid)</span><span data-ccp-props="{}">&nbsp;</span></span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span data-contrast="auto">Paid Vacation, 12 public holidays and sick leave</span><span data-ccp-props="{}">&nbsp;</span></span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span data-contrast="auto">Parental leave</span><span data-ccp-props="{}">&nbsp;</span></span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span data-contrast="auto">Birthday and three winter holidays days off</span><span data-ccp-props="{}">&nbsp;</span></span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span data-contrast="auto">Short term and long term disability insurance</span><span data-ccp-props="{}">&nbsp;</span></span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span data-contrast="auto">2 days paid time off for volunteering per year</span><span data-ccp-props="{}">&nbsp;</span></span></li><li style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span style="font-family: arial, helvetica, sans-serif; font-size: 10pt;"><span data-contrast="auto">Fully remote work environment with stipend on joining for home office </span><span data-ccp-props="{}">&nbsp;</span></span></li></ul><p>&nbsp;</p><div class="content-conclusion"><p>To learn more about how PLOS protects your privacy, see our <a href="https://plos.org/employee-privacy-notice/" target="_blank">Employee Privacy Notice</a>.</p></div>

Loading similar jobs...

USA Remote Jobs

Discover fully remote job opportunities in the United States at USA Remote Jobs. Apply for roles like Software Developer, Customer Service Specialist, Project Manager, and more!

© 2025 Created by USA Remote Jobs. All rights reserved.