Job Overview

We are looking for a Data Scientist with strong experience in Generative AI and Large Language Models (LLMs) to design, build, and deploy intelligent AI-powered applications. The ideal candidate should have hands-on expertise in LLM integration, Retrieval-Augmented Generation (RAG), prompt engineering, and AI model evaluation while ensuring safety, accuracy, and performance optimization.

The role involves working closely with data engineers, ML engineers, and product teams to develop scalable GenAI solutions deployed on cloud platforms such as Azure OpenAI .

Key Responsibilities

Generative AI & LLM Development

Design, build, and optimize LLM-based applications using platforms like OpenAI / Azure OpenAI and locally hosted models.

Develop and refine prompt engineering strategies and system prompts to improve response accuracy and reliability.

Implement Retrieval-Augmented Generation (RAG) pipelines to enable context-aware responses.

RAG & Knowledge Integration

Build scalable retrieval systems using techniques such as document chunking, embeddings, and semantic search .

Implement and manage vector databases such as FAISS or Pinecone for knowledge retrieval.

Integrate structured and unstructured data sources into GenAI pipelines.

Frameworks & AI Development

Develop AI workflows using frameworks like LangChain and LlamaIndex .

Build modular pipelines to support conversational AI, document intelligence, and knowledge assistants.

Model Evaluation & Experimentation

Evaluate LLM outputs using quantitative metrics such as BLEU, BERTScore, and other benchmarking techniques.

Implement human-in-the-loop evaluation frameworks to improve model quality.

Track experiments and model performance using MLflow or similar tools .

AI Safety & Responsible AI

Implement guardrails and safety mechanisms to handle toxicity, bias, and harmful outputs.

Detect and manage PII (Personally Identifiable Information) in AI responses.

Apply techniques for grounding, citation, and hallucination mitigation to ensure reliable AI outputs.

Deployment & Optimization

Deploy scalable GenAI services on Azure, particularly using Azure OpenAI (AOAI) .

Optimize latency, cost, and inference performance for production systems.

Monitor system performance and continuously improve reliability.

Required Skills & Technologies

Programming & AI

Python

NLP libraries such as spaCy, Hugging Face Transformers

Data processing and model experimentation

GenAI & LLM Technologies

OpenAI / Azure OpenAI APIs

Prompt engineering and system prompt design

Retrieval-Augmented Generation (RAG)

Frameworks

LangChain

LlamaIndex

Vector Databases

FAISS

Pinecone

Evaluation & Experiment Tracking

BLEU, BERTScore

MLflow or similar experiment tracking tools

Cloud & Deployment

Azure

Azure OpenAI services

Model deployment and monitoring

Preferred Qualifications

Experience building enterprise GenAI applications such as chatbots, document assistants, or knowledge search platforms.

Understanding of MLOps and LLMOps practices .

Experience with fine-tuning or parameter-efficient tuning techniques .

Familiarity with data governance, AI ethics, and responsible AI frameworks .

Data Scientist