Home Job Details
N
Information Technology 🏒 Full Time ⭐️ Verified

Senior Generative AI Engineer

Nexus Future Labs
San Francisco
Estimated Salary
USD 180.000 – USD 250.000
New
Live Update
2 Juli 2026
Deadline
2 Jul 2027

Job Description

Are you ready to architect the next generation of intelligent systems? Nexus Future Labs is seeking a visionary Senior Generative AI Engineer to lead our research and deployment of cutting-edge Large Language Models (LLMs) and autonomous agents.

We are building the technology stack for the year 2026 and beyond. You will work on optimizing transformer architectures, implementing Retrieval-Augmented Generation (RAG) pipelines, and ensuring ethical AI deployment at scale.

Why join us?

  • Work on state-of-the-art AI models in a high-impact environment.
  • Competitive compensation and equity packages.
  • Flexible remote-first culture with hubs in San Francisco and New York.

Key Responsibilities:

Lead the end-to-end lifecycle of AI model development, from data curation to production deployment.

Optimize inference latency and throughput for large-scale transformer models.

Collaborate with cross-functional teams to integrate AI capabilities into consumer applications.

Establish best practices for model monitoring, logging, and observability.

Contribute to the open-source community and internal research publications.

Responsibilities

  • Design, train, and fine-tune proprietary LLMs using PyTorch and TensorFlow.
  • Architect scalable inference pipelines to handle high-throughput production traffic.
  • Implement and evaluate novel techniques in prompt engineering and reinforcement learning from human feedback (RLHF).
  • Collaborate with product teams to integrate AI capabilities into consumer-facing applications.
  • Ensure data privacy, security, and bias mitigation in all AI models.

Qualifications

  • PhD or Master’s degree in Computer Science, Mathematics, or a related field.
  • 5+ years of professional experience in Machine Learning or Deep Learning engineering.
  • Extensive experience with Python, C++, and GPU acceleration (CUDA).
  • Strong understanding of NLP, Transformer models, and distributed systems.
  • Experience deploying models via Kubernetes and cloud platforms (AWS/GCP).

Required Skills

Python Machine Learning Deep Learning Large Language Models Natural Language Processing PyTorch TensorFlow Kubernetes AWS Generative AI RAG

Ready to Take This Challenge?

Make sure your resume is ready. Submit your application now before the deadline.

Apply Now

Related Jobs

Similar job recommendations for you

View All