Principal Data Engineer – Data & Analytics (Global Supply Chain)
Full-timeSeniorEngineeringOphthalmology
Market Rate — Chemical Engineers
25th
$92K
Median
$112K
75th
$139K
BLS 2024 data (national)
Description
<h2><b>Career Category</b></h2>Information Systems<h2></h2><h2><b>Job Description</b></h2><div><div><p><u><span>Role Description:</span></u><span> </span></p></div><div><p><span><span>This role </span><span>acts as </span></span><span><span>t</span><span>echnical</span><span> architect and hands-on lead</span><span> </span><span>for Data Engineering</span><span> practices</span><span> across the Smart Supply Chain initiative within Amgen.</span><span> </span></span><span><span>Additionally, responsible for designing, building, </span><span>maintaining</span><span>, analyzing, and interpreting data to </span><span>provide</span><span> actionable insights that drive business decisions.</span></span><span> </span></p></div><div><p><span><span>This role involves working with large datasets, developing reports, </span><span>supporting</span><span> and executing data governance </span><span>initiatives</span><span> </span><span>and</span><span>,</span><span> visualizing data </span><span>to </span><span>ensure data is accessible, reliable, and efficiently managed. The ideal candidate has strong technical skills, experience with big data technologies, and a deep understanding of data architecture and ETL processes</span><span> and </span></span><span><span>will architect, build, and </span><span>optimize</span><span> enterprise-grade data pipelines using Databricks and AWS-native services.</span></span><span> </span></p></div><div><p><u><span>Roles & Responsibilities:</span></u><span><span> </span></span><span> </span></p></div><div><ul><li><p><span><span>Design, develop, and </span><span>maintain</span><span> data solutions for data generation, collection, and processing </span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Be a key team member </span><span>that </span><span>assists</span><span> </span><span>in design and development of the data pipeline</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Create data pipelines and ensure data quality by implementing ETL processes to migrate and deploy data across systems</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Contribute to the design, development, and implementation of data pipelines, ETL/ELT processes, and data integration solutions</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Take ownership of data pipeline projects from </span><span>inception</span><span> to deployment, manag</span><span>e </span><span>scope, timelines, and risks</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Collaborate with cross-functional teams to understand data requirements and design solutions that meet business needs</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Develop and </span><span>maintain</span><span> data models, data dictionaries, and other documentation to ensure data accuracy and consistency</span></span><span> </span></p></li></ul></div></div><div><div><ul><li><p><span><span>Implement data security and privacy measures to protect sensitive data</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Leverage cloud platforms (AWS</span><span>, Databricks</span><span> preferred) to build scalable and efficient data solutions</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Collaborate and communicate effectively with product teams</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Collaborate with Data Architects, Business </span><span>SMEs</span><span>, and Data Scientists to design and develop end-to-end data pipeline</span><span>s</span><span> to meet fast paced business need</span><span>s</span><span> across geographic regions</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Identify</span><span> and resolve complex data-related challenges</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Adhere to best practices for coding, testing</span><span>,</span><span> and designing reusable code/component</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>E</span><span>xplore new tools</span><span> and</span><span> technologies that will help to improve ETL platform performance</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Participate in sprint planning meetings and provide estimations on technical implementation</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Continuously </span><span>monitor</span><span> data governance activities and report on compliance, data quality issues, and the effectiveness of governance initiatives</span></span><span> </span></p></li></ul></div><div><p><u><span>Basic Qualifications and Experience:</span></u><span> </span></p></div><div><ul><li><p><span style="font-size:14px">12 - 17 years of experience in Computer Science, IT or related field </span><span style="font-size:14px"> </span></p></li></ul></div><div><p><u><span>Functional Skills:</span></u><span> </span></p></div><div><p><b><span>Must-Have Skills </span></b><span> </span></p></div><div><ul><li><p><span><span>Hands on experience with</span><span> big data technologies and platforms</span><span>, such as Databricks, Apache Spark (</span></span><span><span>Databricks (</span><span>PySpark</span><span>, </span><span>SparkSQL</span><span>, Delta Lake) and AWS</span></span><b><span> </span></b><span><span>services (S3, EMR, Lambda, Glue, UC, Athena, Redshift, EKS), </span></span><span><span>w</span><span>orkflow </span><span>orchestration, performance tuning on big data processing</span><span> </span></span><span><span>and the ability to work with large, complex datasets</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Hands-on experience in orchestrating large-scale data pipelines, performance tuning, lineage tracking, and observability frameworks.</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Proficiency</span><span> in data analysis tools (</span><span>eg.</span><span> </span><span>SQL</span><span>, Python</span><span>) and experience with data visualization </span><span>tools (</span><span>Tableau, Power BI)</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Excellent problem-solving skills </span></span><span><span>Experience with DevOps practices, version control (Git), CI/CD (Jenkins), and Infrastructure as Code.</span></span><span> </span></p></li></ul></div></div><div><div><p><b><span>Good-to-Have Skills:</span></b><span> </span></p></div><div><ul><li><p><span><span>Experience with ETL tools such as Apache </span><span>Spark, and various Python packages related to data processing, machine learning model development</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Strong understanding of data modeling, data warehousing, and data integration concepts</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Working knowledge of unstructured data processing, vector stores, and AI-enablement for downstream analytics.</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Strong understanding of </span><span>SAP data models</span><span> </span><span>(ECC tables) and Supply Chain data domains.</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Experience working in Agile/</span><span>SAFe</span><span> environments with distributed global teams.</span></span><span> </span></p></li></ul></div><div><p><u><span>Professional Certifications</span><span>: </span></u><span> </span></p></div><div><ul><li><p><span><span>Certified Data Engineer </span><span>(preferred on </span><span>Databricks </span><span>or cloud environments)</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Machine Learning Certification (preferred</span><span>)</span></span><span> </span></p></li></ul></div><div><p><u><span>Soft Skills:</span></u><span> </span></p></div><div><ul><li><p><span><span>Excellent </span><span>critical-thinking and </span><span>problem-solving skills </span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Strong communication</span><span> and collaboration skills</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Demonstrated awareness of how to function in a team setting</span></span><span> </span></p></li></ul></div><div><ul><li><p><span><span>Demonstrated presentation skills</span></span><u><span> </span></u><span> </span></p></li></ul></div><div><p><span> </span></p></div></div><p style="text-align:inherit"></p><p style="text-align:inherit"></p><p style="text-align:inherit"></p>.
Amgen
BIOTECHNOLOGY
Small Molecules, Biologics
LocationTHOUSAND OAKS, CA
Employees27,000
Open Jobs1215
OncologyCardiovascularBone HealthImmunologyNeuroscience
View Company ProfilePipeline
Physician SurveyN/A
Peds Metabolic Syndrome in PsoriasisN/A
Persistence With Prolia® (Denosumab) in Postmenopausal Women With OsteoporosisN/A
TAP® Micro Select DeviceN/A
ENBREL®N/A