About Us:
Drug Hunter (drughunter.com) is an essential knowledge platform for biotech/pharma R&D scientists that serves thousands of R&D leaders from hundreds of major pharma and biotech companies around the world.
We are seeking a highly motivated Patent Informatics Data Scientist to assemble, analyze novel chemical and biological datasets, and contribute towards innovative solutions in drug design.
Job Summary:
As Patent Informatics Data Scientist, you will apply a wide variety of approaches to building, curating and integrating drug discovery patent data, with an initial focus on small molecule patents. The job is a unique blend of organization, and curation of patent data, alongside data/software engineering to build high quality data views and content for our platform and users. Deep domain knowledge of the patent process for pharmaceuticals, is required, alongside hands-on experience of public domain/commercial patent data systems in the drug discovery space.
You will work closely with both the internal product team and external partners who range from medicinal chemists, pharmacologist, strategists, and software engineers to enhance our product and support of the community.
Key Responsibilities:
- Develop and apply patent data-gathering and mining approaches to build high quality foundation data sets.
- Process and analyze small- and large-chemical and biological intellectual property datasets, including therapeutic use, molecular target and chemical structure data.
- Integrate chemical and biological databases to patent data.
- Develop novel data mining approaches to unearth cryptic data that enables decision making in partner organizations.
- Optimize data pipelines for processing and storing patent data, using text-mining, cheminformatics and bioinformatics approaches.
- Collaborate with cross-functional teams to integrate computational approaches into curation and analysis workflows.
- Contribute to thought leader articles on drug intellectual property informatics and data mining.
- Maintain best practices in data integrity, reproducibility, and documentation of data sources and derived content.
- Other responsibilities as required to support product development and maintenance.
Requirements
Required Qualifications:
- Ph.D. or Master’s degree in Cheminformatics, Computational Chemistry, Bioinformatics, Data Science, or a related field.
- 5-10+ years of experience in cheminformatics, computational drug discovery, or machine learning applications in chemistry.
- Proficiency in Python/R, with experience in cheminformatics libraries and topics.
- Strong knowledge of molecular descriptors, drug targets, and chemical/biological informatics techniques.
- An innate sense of how to query and derive value from patent data.
- Familiarity with Open Source and academic/commercial competitive intelligence/patent systems.
- Experience working in a structured collaborative data and software development environment (git, SQL/Postgres, python notebooks).
- Exceptional communication skills in written and verbal communication of science, a natural story-teller to make sense and provide insights from complex data.
Preferred Qualifications:
- Understanding of regulatory and patent landscapes for chemical and pharmaceutical data.
- Text mining experience, NER/NLP. Existing expertise in Python and relational database systems. API development and systems architecture.
- We understand that we are looking for a broad range of skills, so are committed to on the job coaching from experienced team members.
Benefits
What We Offer:
- Competitive salary and stock options.
- Active mentoring in data science/drug discovery within a highly experienced team
- Professional development opportunities and conference sponsorship.
- A collaborative environment working on cutting-edge computational drug discovery and data science.