Job Description
We are building the operating system of tomorrow. Project 2026 is our ambitious initiative to redefine how autonomous agents interact with enterprise data.
Join a team of elite engineers pushing the boundaries of Generative AI, LLM orchestration, and predictive analytics. You will architect the core infrastructure that will power our next-generation intelligent systems.
Why Join Us?
We are not just building software; we are shaping the future of human-computer interaction for the year 2026 and beyond.
Responsibilities
- Architect LLM Pipelines: Design and deploy scalable inference systems for large language models, optimizing for latency and throughput.
- Autonomous Agent Development: Build and refine autonomous agents capable of complex reasoning, tool use, and multi-step planning.
- Model Fine-Tuning: Collaborate with data science leads to fine-tune open-source models (Llama, Mistral) for specific domain expertise.
- Vector Database Optimization: Implement and manage high-performance vector similarity search for RAG (Retrieval-Augmented Generation) applications.
- Security & Compliance: Ensure all AI models adhere to strict data privacy regulations and implement robust prompt injection defenses.
- Performance Engineering: Monitor system health, reduce token costs, and improve inference speeds.
Qualifications
- Experience: 5+ years of professional software engineering experience, with a strong focus on Machine Learning or AI.
- Programming: Proficiency in Python, PyTorch, or TensorFlow. Experience with Rust or Go is a plus.
- LLM Knowledge: Deep understanding of Transformer architectures, Attention mechanisms, and current state-of-the-art generative models.
- Data Skills: Strong background in data structures, algorithms, and distributed systems.
- Education: BS, MS, or PhD in Computer Science, Mathematics, or a related technical field.
- Communication: Ability to translate complex technical concepts into clear, actionable insights for non-technical stakeholders.