Job Description
Data Engineer - R&D Precision Medicine
What you will do
Let's do this. Let's change the world. In this vital role, you will be responsible for:
- End-to-end development of an enterprise analytics and data mastering solution using Databricks and Power BI
- Creating scalable, reliable, and impactful enterprise solutions that support research cohort-building and advanced research pipeline
- Building and surfacing large unified repositories of human data based on integrations from multiple repositories and solutions
- Collaborating closely with key customers, product team members, and related IT teams to design and implement data models, integrate data, and ensure best practices for data governance and security
- Designing and building scalable enterprise analytics solutions using Databricks, Power BI, and other modern data tools
- Leveraging data virtualization, ETL, and semantic layers to balance unification, performance, and data transformation while reducing data proliferation
- Breaking down features into work that aligns with the architectural direction runway
- Participating hands-on in pilots and proofs-of-concept for new patterns
- Creating robust documentation from data analysis and profiling, and proposed designs and data logic
- Developing advanced SQL queries to profile and unify data
- Developing data processing code in SQL, along with semantic views to prepare data for reporting
- Developing PowerBI models and reporting packages
- Designing robust data models and processing layers that support both analytical processing and operational reporting needs
- Designing and developing solutions based on best practices for data governance, security, and compliance within Databricks and Power BI environments
- Ensuring the integration of data systems with other enterprise applications to create seamless data flows across platforms
- Developing and maintaining Power BI solutions, optimizing data models and reports for performance and scalability
- Collaborating with key customers to define data requirements, functional specifications, and project goals
- Continuously evaluating and adopting new technologies and methodologies to enhance the architecture and performance of data solutions
What we expect of you
The R&D Data Catalyst Team is responsible for:
- Building Data Searching, Cohort Building, and Knowledge Management tools for scientific visibility into human datasets, projects, study histories, and scientific findings
- Supporting Amgen's goal to accelerate discovery and speed to market for precision medications
Basic Qualifications
- Master's degree: 1 to 3 years of Data Engineering experience
- Bachelor's degree: 3 to 5 years of Data Engineering experience
- Diploma: 7 to 9 years of Data Engineering experience
Must-Have Skills
- Minimum of 3 years of hands-on experience with BI solutions (Preferably Power BI or Business Objects) including report development, dashboard creation, and optimization
- Minimum of 3 years of hands-on experience building Change-data-capture (CDC) ETL pipelines, data warehouse design and build, and enterprise-level data management
- Hands-on experience with Databricks, including data engineering, optimization, and analytics workloads
- Deep understanding of Power BI, including model design, DAX, and Power Query
- Proven experience designing and implementing data mastering solutions and data governance frameworks
- Expertise in cloud platforms (AWS), data lakes, and data warehouses
- Strong knowledge of ETL processes, data pipelines, and integration technologies
- Good communication and collaboration skills for working with cross-functional teams and senior leadership
- Ability to assess business needs and design solutions that align with organizational goals
- Strong hands-on capabilities with data profiling, data transformation, data mastering
- Success in mentoring and training team members
Good-to-Have Skills
- Experience in developing differentiated and deliverable solutions
- Experience with human data, ideally human healthcare data
- Familiarity with laboratory testing, patient data from clinical care, HL7, FHIR, and clinical trial data management
Professional Certifications
- ITIL Foundation or other relevant certifications (preferred)
- SAFe Agile Practitioner (6.0)
- Microsoft Certified Data Analyst Associate (Power BI) or related certification
- Databricks Certified Professional or similar certification
Soft Skills
- Excellent analytical and troubleshooting skills
- Deep intellectual curiosity
- High degree of initiative and self-motivation
- Strong verbal and written communication skills, including presenting complex technical/business topics to varied audiences
- Confidence as a technical leader
- Ability to work effectively with global, remote teams
- Ability to handle multiple priorities successfully
- Team-oriented, with a focus on achieving team goals
- Strong problem-solving and analytical skills
- Ability to learn quickly and synthesize complex information from diverse sources
Skills Required
Databricks, Power Bi, Sql, Etl, Aws, Data Warehousing, Data Engineer
Apply for this Position
Ready to join ? Click the button below to submit your application.
Submit Application