Lead Data Engineer (genai / Llm Applications) @ Clario

Home > Data Science & Machine Learning

Posted 40 days ago — confirm the vacancy is still active.

Lead Data Engineer (genai / Llm Applications)

Clario
5 - 9 years
Bengaluru
1 month ago
Email to a friend
Report this job

Job Description

We are looking for a skilled and motivated Lead Engineer to join our Data Science and Delivery group at Clario, a part of Thermo Fisher Scientific. This role combines software development, data engineering, and analytical problemsolving to design, build, and maintain scalable data platforms that support clinical trial operations and business intelligence. You will work across the full software development lifecycle (SDLC)from requirements gathering through production supportcollaborating closely with data scientists, analysts, product managers, and engineering teams to deliver highquality, datadriven solutions.

What We Offer

Competitive compensation aligned with local market practices
Comprehensive health and wellness benefits
Paid time off and company holidays
Opportunities for professional development, learning, and career growth
The flexibility of working from Bangalore or remotely within India, while collaborating with global teams

What Youll Be Doing

Design, develop, and maintain scalable software architectures and data pipelines that integrate with analytical and operational systems.
Write clean, reusable, and welltested Python code using frameworks such as Flask and related libraries.
Leverage AIassisted development tools, including GitHub Copilot and LangChain, to design, build, and integrate LLMpowered solutions such as retrievalaugmented generation (RAG) pipelines, intelligent agents, and automated workflows using AWS Bedrock or similar services.
Develop and optimize complex SQL across Oracle, MS SQL Server, PostgreSQL, and Snowflake, including procedures, functions, views, analytical functions, and dynamic SQL.
Design and implement ETL pipelines using Snowflake and related data processing technologies.
Implement scheduling and orchestration using Apache Airflow or similar workflow orchestration frameworks.
Establish and maintain data quality frameworks, versioning, and governance practices to ensure data reliability, integrity, and compliance.
Develop and maintain data architectures and models for both structured and unstructured data sources.
Troubleshoot production issues and drive continuous improvement in software quality, performance, and reliability.
Deploy, manage, and support solutions on AWS, including storage, compute, and pipeline services.
Create sourcetotarget mappings and support data and code migration initiatives.
Partner with stakeholders to gather requirements, translate business needs into technical solutions, and produce clear, wellstructured documentation.
Collaborate with product managers, analysts, and crossfunctional teams to deliver datadriven insights and reporting using tools such as Plotly and Power BI.

What We Look For

Bachelors or higher degree in Computer Science, Information Technology, or a related technical field.
5+ years of professional experience in software engineering, data engineering, or datafocused development roles.
Strong proficiency in Python, including frameworks and libraries such as Django or Flask, pandas, NumPy, Plotly, and agGrid.
Strong SQL expertise with Oracle, MS SQL Server, PostgreSQL, and/or Snowflake.
Proven experience writing complex SQL, including analytical and window functions, subqueries, all join types, DML/DDL/TCL statements, CASE expressions, and performance tuning.
Working knowledge of cloud platforms, with a preference for AWS (S3, EC2, Secrets Manager, Bedrock, Lambda).
Experience using AIassisted development tools and frameworks such as GitHub Copilot and LangChain for building LLMpowered applications and workflows.
Experience with Gitbased version control systems and CI/CD pipelines.
Familiarity with data modeling concepts for both structured and unstructured data.
Strong analytical thinking, problemsolving abilities, and communication skills.
Willingness to work across all phases of the SDLC, including requirements gathering, design, development, deployment, and production support.
Preferred experience includes exposure to the clinical trial lifecycle or clinical data management, data visualization tools (Plotly, Power BI), frontend technologies (HTML5, CSS3, JavaScript), collaboration tools (Jira, Confluence, Microsoft Teams), and handson data analysis or data cleansing using programming languages, SQL, and Excel.

At Clario, our purpose is to transform lives by unlocking better evidence. Its a cause that unites and inspires us. Its why we come to workand how we empower our people to make a positive impact every day. Whether youre starting your clinical data career or building longterm expertise, your work helps bring lifechanging therapies to patients faster.

Job Classification

Industry: Pharmaceutical & Life Sciences
Functional Area / Department: Data Science & Analytics
Role Category: Data Science & Machine Learning
Role: Data Engineer
Employement Type: Full time

Contact Details:

Company: Clario
Location(s): Bengaluru

+ View Contact

Login

Candidates can login here to view contacts and apply.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach Resume Max 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Candidates are expected to provide most recent and accurate profile information, inappropriate content is strictly prohibited!

Keyskills: Etl Pipelines Complex Sql Query Power Bi AWS Python Langchain Snowflake Github Copilot Ci/Cd SQL

Fraud Alert to job seekers!

₹ Not Disclosed

Job application

We will notify the employer with your details. You can also attach a resume or a cover letter.

Sign In Sign Up

Email:

Password:

Password too short

To create your profile, apply for a job or make a registration

Your name (*)

Email (*)

Mobile (*)

Preferred City (* max. 2 w/comma)

Designation / Expected Role

Current / Recent Company (*)

Experience (*)

Expected Salary (*)

Desired Industry (*):

Functional area / Department (*):

Enter Skills (key skills, subjects, technologies & roles to use in search)

Write briefly about yourself, your experience and education (*)

Attach ResumeMax 2.38 MB (RTF, PDF, DOC, DOCX formats only parsed)

Please, check the file size and type.

Add social media [ + ]

Create password

I agree with website service terms and conditions

Clario

At Clario, we put people first, always. We are united and driven by patients, committed to making a difference, and we are always looking for the best talent to help us transform lives. We value the contribution each of our people brings. Its only through our people that we can continue to innova...

Lead Data Engineer... in Bengaluru

CSE Propulsion Engineers... Capgemini

Memory Layout Engineer Capgemini

Senior Software Engineer -... IBM

Splunk Monitoring Engineer... Trigent Software

Senior Product Engineers Cognizant

Senior Software Engineer -... IBM
See all →

Lead Data Engineer (genai / Llm Applications) @ Clario

Home > Data Science & Machine Learning

Lead Data Engineer (genai / Llm Applications)

Job Description

Job Classification

Contact Details:

Create password

Create password

Clario

Job Listings

Job type