Job Description
Are you ready to architect the next generation of intelligent systems? Nexus Future Labs is seeking a visionary Senior Generative AI Engineer to lead our research and deployment of cutting-edge Large Language Models (LLMs) and autonomous agents.
We are building the technology stack for the year 2026 and beyond. You will work on optimizing transformer architectures, implementing Retrieval-Augmented Generation (RAG) pipelines, and ensuring ethical AI deployment at scale.
Why join us?
- Work on state-of-the-art AI models in a high-impact environment.
- Competitive compensation and equity packages.
- Flexible remote-first culture with hubs in San Francisco and New York.
Key Responsibilities:
Lead the end-to-end lifecycle of AI model development, from data curation to production deployment.
Optimize inference latency and throughput for large-scale transformer models.
Collaborate with cross-functional teams to integrate AI capabilities into consumer applications.
Establish best practices for model monitoring, logging, and observability.
Contribute to the open-source community and internal research publications.
Responsibilities
- Design, train, and fine-tune proprietary LLMs using PyTorch and TensorFlow.
- Architect scalable inference pipelines to handle high-throughput production traffic.
- Implement and evaluate novel techniques in prompt engineering and reinforcement learning from human feedback (RLHF).
- Collaborate with product teams to integrate AI capabilities into consumer-facing applications.
- Ensure data privacy, security, and bias mitigation in all AI models.
Qualifications
- PhD or Masterβs degree in Computer Science, Mathematics, or a related field.
- 5+ years of professional experience in Machine Learning or Deep Learning engineering.
- Extensive experience with Python, C++, and GPU acceleration (CUDA).
- Strong understanding of NLP, Transformer models, and distributed systems.
- Experience deploying models via Kubernetes and cloud platforms (AWS/GCP).