Job Description
Are you ready to define the future of artificial intelligence?
Nebula AI Systems is at the forefront of developing next-generation Large Language Models (LLMs) designed to revolutionize enterprise productivity by 2026. We are seeking a visionary Senior AI Engineer to join our elite R&D team in San Francisco.
In this role, you won't just be maintaining models; you will architect the architecture of tomorrow. You will work directly with our Chief Scientists to optimize model inference, fine-tune proprietary datasets, and deploy scalable AI solutions that solve complex, real-world problems.
If you are passionate about pushing the boundaries of Generative AI and have a knack for solving high-dimensional data challenges, we want to meet you.
Responsibilities
- Model Architecture: Design and implement cutting-edge neural network architectures for Large Language Models, focusing on efficiency and accuracy.
- Performance Optimization: Conduct rigorous performance profiling to reduce latency and increase throughput in real-time inference environments.
- Deployment: Oversee the end-to-end deployment of AI models using containerization (Docker/Kubernetes) and MLOps best practices on cloud infrastructure.
- Research & Development: Stay ahead of the curve with the latest advancements in NLP, Transformers, and Reinforcement Learning from Human Feedback (RLHF).
- Collaboration: Partner with product managers and software engineers to integrate AI capabilities seamlessly into production applications.
- Mentorship: Guide junior engineers and data scientists, fostering a culture of innovation and technical excellence.
Qualifications
- Education: Masterβs or PhD in Computer Science, Machine Learning, or a related field.
- Experience: 5+ years of professional experience in AI/ML engineering, with at least 2 years specifically focused on LLMs or Generative AI.
- Programming: Proficiency in Python, PyTorch, or TensorFlow. Solid understanding of C++ for performance-critical components.
- Mathematics: Strong grasp of linear algebra, calculus, probability, and statistics.
- Tools: Experience with Hugging Face, LangChain, MLflow, and AWS SageMaker or Google Vertex AI.
- Soft Skills: Excellent problem-solving abilities and the ability to communicate complex technical concepts to non-technical stakeholders.