Integration

Machine Learning Engineer

Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.

The Role

- 4+ years of ML engineering in production (not just research or notebooks).

- Hands-on LLM experience in 2025-2026: agentic systems, tool-use, function-calling, RAG, structured output, eval design.

- Strong Python. Comfortable with PyTorch/JAX and one serving stack (vLLM, TGI, TensorRT-LLM, SageMaker, or similar).

- You've built an eval pipeline that actually caught a regression in prod.

- You read the papers and know which ones to ignore.

What You'll Do

- Design and ship the ML backbone of Gini AI Workers - routing, tool selection, reasoning, memory, evaluation.

- Build evaluation and feedback loops - offline evals, online A/B, regression harnesses, human-in-the-loop labeling pipelines.

- Optimize cost and latency across the agent stack: prompt engineering, model routing (frontier ↔ small ↔ fine-tuned), caching, speculative decoding, distillation.

- Fine-tune and/or RAG-tune models for vertical enterprise tasks (invoice extraction, PO matching, ticket triage, forecasting).

- Own the ML infra - training pipelines, experiment tracking, model registry, deployment, monitoring, drift detection.

- Partner with backend + product to turn research into shipped features on a weekly cadence.

What You Bring

- 4+ years of ML engineering in production (not just research or notebooks).

- Hands-on LLM experience in 2025-2026: agentic systems, tool-use, function-calling, RAG, structured output, eval design.

- Strong Python. Comfortable with PyTorch/JAX and one serving stack (vLLM, TGI, TensorRT-LLM, SageMaker, or similar).

- You've built an eval pipeline that actually caught a regression in prod.

- You read the papers and know which ones to ignore.

Nice to Have

- Experience with MCP, LangGraph, DSPy, or custom agent frameworks.

- Fine-tuning (LoRA/QLoRA, DPO/ORPO, RLAIF) on open-weight models (Llama, Qwen, Mistral, DeepSeek).

- Vector DBs (pgvector, Pinecone, Weaviate, Qdrant), reranking, hybrid retrieval.

- Prior work on multi-agent systems or enterprise copilots.

Why This Role Matters

[{"_id": "9f1b2c5a-6e70-9d44-be0f-b832d4ff9a43", "tag": "p", "data": {}, "type": "Paragraph", "classes": [], "children": ["9f1b2c5a-6e70-9d44-be0f-b832d4ff9a44"]}, {"v": "We're shipping AI Workers to 100+ enterprises. Your models don't just answer questions - they post invoices, reconcile accounts, and route tickets. The bar for reliability is enterprise, not consumer. That's the fun part.", "_id": "9f1b2c5a-6e70-9d44-be0f-b832d4ff9a44", "text": true}]

Book a Demo

Fill out the form or click on the tab to book a call directly with one of our solutions experts.
Black outlined hexagons arranged in a honeycomb pattern on a white background with subtle blue highlights.
By submitting this form, you agree to our Privacy Policy
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.