Job Description

Location: Remote

Duration: 2–4 months (project-based)

Type: Contract / Research Collaboration (Paid)

About the Project

We are looking for a Master’s or PhD student to work on fine-tuning large language models (LLMs) for domain-specific tasks. The goal is to take an existing pretrained model (e.g., Meta AI’s LLaMA-class models or similar) and specialize it for a narrow, high-value use case using efficient fine-tuning techniques.

This is a hands-on applied project designed for someone who wants real-world experience deploying and optimising LLM systems.

Help drive the next wave of applied AI by demonstrating how fine-tuned LLMs can unlock advanced, real-world use cases beyond general-purpose foundation models. Organizations that require domain-specific accuracy, self-hosted deployments, customisable workflows, or performance beyond out-of-the-box capabilities increasingly rely on fine-tuned models to meet those needs.

Through this project, you will contribute to building specialised AI systems that deliver improved accuracy, efficiency, and control compared to out-of-the-box models. You will also help bridge the gap between academic knowledge and real-world application by applying fine-tuning techniques to solve concrete business problems.

What You’ll Work On

�� Fine-tuning pre-trained LLMs on small to medium datasets (500–20k examples)
�� Implementing parameter-efficient fine-tuning (e.g., LoRA-style methods)
�� Optimising training for cost and performance
�� Running experiments on GPU cloud infrastructure
�� Evaluating model performance and tradeoffs (specialisation vs generalisation)
�� Deploying fine-tuned models for inference

Experience

�� Strong Python skills
�� Experience with deep learning frameworks: PyTorch (preferred) or TensorFlow
�� Experience with Hugging Face Transformers or similar ecosystems
�� Hands-on experience training or fine-tuning transformer models on GPUs (local or cloud-based)
�� Previous experience using cloud platforms for model training or deployment (e.g., AWS, GCP, Azure, RunPod or similar GPU providers)
�� Experience working with or fine-tuning open-weight LLM families (Gemma-3, Qwen-3.5, Llama 4, GPT-OSS, Mistral...)
�� Hands-on experience with LoRA

Understanding of:

�� Fine-tuning vs pretraining
�� Overfitting and generalization
�� Model evaluation
�� Strong business awareness: ability to understand the context of the fine-tuning task and translate domain requirements into clear modeling objectives

What you bring

�� MSc or PhD student in Computer Science, Machine Learning, AI, or related field
�� Alternatively, 6 months of hands-on experience training and fine-tuning deep learning models
�� Has worked on LLMs in research or industry
�� Has fine-tuned at least one transformer model
�� Comfortable working independently
�� Interested in applied AI and real-world constraints (cost, latency, memory)

What You’ll Gain

�� Real-world experience fine-tuning large models (30B–100B parameter class)
�� Exposure to production constraints and deployment
�� Opportunity to co-author technical writeups if applicable
�� Strong applied portfolio project

What We Offer

�� 100% Remote Work: Work from anywhere with flexibility and autonomy
�� Dynamic, High-Impact Projects: Work on cutting-edge ML and GenAI solutions across diverse industries
�� International Clients: Collaborate with global organizations and solve real-world challenges at scale
�� Urban Sports Club Membership: Supporting your physical and mental wellbeing
�� Monthly Bolt Credits: For rides
�� Company Events & Offsites: Regular team gatherings to connect, collaborate, and celebrate
Originally posted on Himalayas

AI Researcher

Job Description

Understanding of:

More Remote Jobs

Sign in to apply