Data Scientist, RNA Biology
Data ScienceSmall Molecule
$135K/yr(estimated)
Description
<p><span style="font-family: helvetica, arial, sans-serif;">At Atomic AI, we build artificial intelligence to pioneer new frontiers in drug discovery. Our unique R&amp;D platform, an early version of which was featured on the <a href="https://www.science.org/doi/abs/10.1126/science.abe5650">cover of Science</a>, provides new strategies to treat previously undruggable diseases by targeting RNA. We continue to advance this platform by developing new machine learning methods and&nbsp;<a href="https://www.biorxiv.org/content/10.1101/2023.12.13.571579v1">unique foundation models</a> fueled by our large-scale, in-house experimental data collection. We are an interdisciplinary team of scientists and engineers and believe our people are our greatest strength and the key to our success.<br><br></span></p>
<p><span style="font-family: helvetica, arial, sans-serif;"><strong>The opportunity</strong></span></p>
<p><span style="font-family: helvetica, arial, sans-serif;">As a full-time Data Scientist on the Machine Learning team, you will work closely with scientists and engineers to apply and advance our technology platform for RNA structure prediction, target identification, and early-stage drug discovery. You will lead the curation of RNA-focused datasets for ML training and validation, discover statistical patterns in our large-scale datasets evaluating RNA structure and RNA-small molecule interactions, and devise and implement new strategies to test the accuracy of our ML models. Your analysis will guide the development of improved ML models and the targeted acquisition of new experimental data.</span></p>
<p><span style="font-family: helvetica, arial, sans-serif;">This is a hybrid position with three days in-person at our South San Francisco office.<br><br></span></p>
<p><span style="font-family: helvetica, arial, sans-serif;"><strong>Responsibilities:</strong></span></p>
<ul>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Provide RNA biology and RNA structure expertise on the ML team.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Enable and apply our RNA-structure platform to prioritize RNA targets for small-molecule therapies and advance structure-based drug discovery.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Generate insights from datasets on RNA structures and small molecule interactions (e.g. chemical probing, RNA-SM screens) by conducting statistical analyses, interpreting biological noise, and applying RNA domain expertise.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Curate RNA datasets for training of ML models, help evaluate model performance, and provide directions for improvement.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Inform scientific questions and ML model development in early-stage RNA drug discovery.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Collaborate with the internal wetlab team and shape the design of experimental assays on RNA structure and RNA-SM interactions.<br><br></span></li>
</ul>
<p><span style="font-family: helvetica, arial, sans-serif;"><strong>About you:</strong></span></p>
<ul>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Ph.D. in Computational Biology, Bioinformatics, Statistics, Biophysics, or related field, or equivalent experience.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Expertise in RNA biology and biochemistry, RNA-protein interactions, and RNA structure.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Proficiency in Python for data curation and analysis at scale, and fluency with libraries for data analysis (NumPy, pandas) and applied ML (scikit-learn).</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Strong programming background, familiarity with Unix and comfort with using external software packages.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Strong foundation in statistics and experience with conducting statistical analysis of large-scale datasets.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Excellent presentation and writing skills, able to clearly communicate technical information to colleagues.<br><br></span></li>
</ul>
<p><span style="font-family: helvetica, arial, sans-serif;"><strong>Pluses:</strong></span></p>
<ul>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">History of scientific achievement, e.g. as evidenced by publication of impactful papers.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Conceptual understanding of ML model development and evaluation, and experience using ML models.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Experience with computational structural biology tools for modeling RNA secondary and tertiary structure (e.g. Rosetta, AlphaFold, RNAfold).</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Understanding of RNA-SM interactions, including familiarity with structural properties of binding sites and experimental methods for evaluating binding.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Proficiency with pipelines for next-generation sequencing dataset processing.</span></li>
<li style="font-family: helvetica, arial, sans-serif;"><span style="font-family: helvetica, arial, sans-serif;">Exposure to high throughput experimental assays for evaluating RNA structure and screening RNA-SM interactions.<br><br></span></li>
</ul>
<p><span style="font-family: helvetica, arial, sans-serif;">Salary Range: $135,000/year to $180,000/year + equity + benefits. This range reflects variations in seniority, expertise, and skills.&nbsp;</span></p>
<p>&nbsp;</p><div class="content-conclusion"><p>Atomic AI is committed to equal employment opportunity&nbsp;regardless of race, color, ancestry, national origin, religion, sex, age, sexual orientation, gender identity and expression, marital status, disability, or veteran status.</p></div>
Atomic AI
BIOTECHNOLOGY
AI-based RNA Drug Discovery
LocationCA - South SF
Open Jobs2
Gene Therapy
View Company Profile