Machine Learning Engineer
Apply NowJob details
An exceptional opportunity to join an innovative, high-growth organisation shaping the future of AI-powered automation and digital interaction. We’re seeking a Machine Learning Engineer with full-stack development experience to work on cutting-edge projects involving Generative AI , Retrieval-Augmented Generation (RAG) , and multi-agent reasoning frameworks . This is a hands-on, end-to-end engineering role with impact across the full ML lifecycle – from experimentation to deployment. Conversational AI & Reasoning: Design, fine-tune, and deploy advanced LLMs with agentic capabilities RAG Pipelines: Build and optimise scalable pipelines for structured and unstructured data retrieval LLM Training & Fine-Tuning: Use methods like LoRA, QLoRA, SFT, PEFT, and RLHF Inference & Acceleration: Serve models using vLLM, DeepSpeed, Triton, TensorRT Multi-Agent Orchestration: Work with LangChain, AutoGen, CrewAI, DSPy and similar tools Cloud & MLOps (AWS): Deploy with SageMaker, Bedrock, Lambda, S3, ECS, EKS Full-Stack Integration: Build APIs (FastAPI, Flask) and integrate with React, TypeScript, Node.js Vector Search: Use FAISS, Weaviate, Pinecone, ChromaDB, OpenSearch Required skills & experience: 3–5 years of experience in ML engineering and software development Deep Python proficiency, with PyTorch, TensorFlow or Hugging Face Proven experience with LLMs, RAG, and deploying cloud-native AI on AWS Strong full-stack skills (React, TypeScript, Node.js) and API development Familiarity with vector databases and multi-agent frameworks Apply now to join this high growth and award-winning organisation with the opportunity to be part of building the future of AI driven projects and solutions. The role offers a highly competitive salary and benefits package and will be office based in London.
Apply Now