Your browser does not support javascript! Please enable it, otherwise web will not work for you.

Data Scientist @ Coforge

Home > Data Science & Machine Learning

 Data Scientist

Job Description

Job Description: Data Scientist LLM / NLP / ML & Anomaly Detection

About the Role

We are seeking a highly skilled and innovative Data Scientist to join our AI & Data Science team. In this role, you will design, build, and deploy advanced machine learning solutions with a strong emphasis on Natural Language Processing (NLP), Large Language Models (LLMs), and Anomaly Detection. You will work on real-world, high-impact problems from detecting fraudulent transactions and system anomalies to building intelligent document processing pipelines collaborating closely with engineering, product, and business teams.

Key Responsibilities

LLM & Generative AI

  • Design, build, and deploy LLM-powered applications using frameworks such as LangChain, LlamaIndex, or OpenAI API.
  • Develop and optimize prompt engineering strategies (few-shot, chain-of-thought, RAG) to improve the accuracy, consistency, and reliability of LLM outputs.
  • Implement Retrieval-Augmented Generation (RAG) pipelines using vector databases (e.g., FAISS, Pinecone, Chroma, Weaviate).
  • Fine-tune pre-trained LLMs (e.g., GPT, LLaMA, Mistral, Falcon, Claude,Gemini) on domain-specific datasets.
  • Validate and structure LLM outputs using Pydantic models and output parsers to ensure data integrity.

Natural Language Processing (NLP)

  • Build end-to-end NLP pipelines for real-world tasks including:
    • Named Entity Recognition (NER)
    • Text Classification & Sentiment Analysis
    • Information & Data Extraction from Documents
    • Document Summarization & Question Answering
    • Semantic Search & Document Similarity
  • Work with the Hugging Face Transformers ecosystem to leverage and fine-tune pre-trained models (BERT, RoBERTa, T5, etc.).
  • Process large-scale unstructured text data from various sources such as PDFs, emails, scanned documents (OCR), and web content.

Anomaly Detection

  • Design and implement anomaly detection systems for various domains, including:
    • Financial fraud detection (unusual transactions, payment anomalies).
    • Operational anomalies (system logs, network traffic, sensor data).
    • Text-based anomalies (unusual document patterns, suspicious NLP signals).
  • Apply a wide range of anomaly detection techniques including:
    • Statistical Methods: Z-score, IQR, CUSUM.
    • ML-based Methods: Isolation Forest, One-Class SVM, Local Outlier Factor (LOF).
    • Deep Learning Methods: Autoencoders, LSTM-based sequence anomaly detection, Variational Autoencoders (VAEs).
    • Time-Series Methods: ARIMA, Prophet, Seasonal Decomposition.
  • Build real-time and batch anomaly detection pipelines that can scale to large datasets.
  • Define and tune detection thresholds and alert mechanisms in collaboration with business and operations teams.

Machine Learning (ML)

  • Design, train, evaluate, and deploy supervised and unsupervised machine learning models.
  • Perform feature engineering, model selection, hyperparameter tuning, and cross-validation.
  • Build and maintain end-to-end ML pipelines from data ingestion to model serving.
  • Monitor model performance in production and implement retraining strategies to address data drift and model decay.
  • Communicate model results, performance metrics, and business impact to technical and non-technical stakeholders.

Python & Software Engineering

  • Write clean, modular, production-quality, and well-documented Python code.
  • Build and expose ML models as REST APIs using FastAPI or Flask.
  • Collaborate with MLOps/DevOps engineers to containerize (Docker) and deploy models in cloud environments.
  • Follow best practices in version control (Git), testing, and CI/CD pipelines.

Data & Analytics

  • Perform Exploratory Data Analysis (EDA) on structured and unstructured datasets to identify patterns, trends, and anomalies.
  • Work with data from relational databases (SQL), data lakes, and cloud storage solutions.
  • Create compelling and clear data visualizations (Matplotlib, Seaborn, Plotly) to communicate findings.

Job Classification

Industry: IT Services & Consulting
Functional Area / Department: Data Science & Analytics
Role Category: Data Science & Machine Learning
Role: Data Scientist
Employement Type: Full time

Contact Details:

Company: Coforge
Location(s): Hyderabad

+ View Contactajax loader


Keyskills:   Anomaly Detection genai Langchain LLM Generative Artificial Intelligence Python

 Fraud Alert to job seekers!

₹ 15-25 Lacs P.A

Similar positions

AI Incubator - Data Scientist

  • Barclays
  • 2 - 7 years
  • Noida, Gurugram
  • 2 days ago
₹ Not Disclosed

Data Scientist

  • LatentView
  • 3 - 6 years
  • Bengaluru
  • 2 days ago
₹ Not Disclosed

Data Scientist

  • Photon
  • 3 - 8 years
  • Pune
  • 2 days ago
₹ 0-20 Lacs P.A.

Senior Data Scientist - Gen Ai

  • Happiest Minds
  • 5 - 8 years
  • Bengaluru
  • 2 days ago
₹ Not Disclosed

Coforge

Coforge is a leading global IT solutions organization, enabling its clients to transform at the intersect of unparalleled domain expertise and emerging technologies to achieve real-world business impact. A focus on very select industries, a detailed understanding of the underlying processes of those...

Job Listings