Job Description
Summary
As Head of AI (R&D), you will lead cross-functional teams to drive innovation in training multimodal large language models (LLMs), video foundation models, and other advanced AI systems. These models will be optimized for both cloud-based deployments and edge computing environments, ensuring scalability, efficiency, and real-world applicability. You will report directly to senior leadership and play a pivotal role in shaping the company's AI strategy, from research breakthroughs to production-ready solutions.
Key Responsibilities
- Lead and mentor interdisciplinary teams of researchers, engineers, and data scientists to pioneer cutting-edge AI research and development.
- Innovate on model architectures, training methodologies, and optimization techniques for multimodal LLMs (integrating text, image, audio, and video) and specialized video foundation models.
- Scale the R&D team by recruiting top talent, fostering a collaborative culture, and implementing processes for efficient growth and knowledge sharing.
- Oversee scalable training pipelines, including data curation, distributed computing strategies (e.g., using HPC clusters or cloud resources), and hyperparameter optimization to handle massive datasets.
- Deliver high-quality, general-purpose models tailored to specific applications and use cases, ensuring they meet performance benchmarks for accuracy, latency, and resource efficiency on cloud and edge devices.
- Collaborate with product, engineering, and deployment teams to transition research prototypes into production systems.
- Stay abreast of industry trends, publish research findings, and represent the company at conferences or in partnerships.
- Manage budgets, timelines, and resources for R&D projects, ensuring alignment with business objectives and pragmatic AI practices.
Job requirements
- Proven experience leading AI R&D teams (5+ years in a leadership role) with a track record of delivering production-grade models from concept to deployment.
- PhD in Computer Science, Machine Learning, Artificial Intelligence, or a related field.
- Deep expertise in training foundation models, including multimodal LLMs and video models, with hands-on experience in frameworks like PyTorch, TensorFlow, or JAX.
- Strong understanding of scalable training techniques, such as data parallelism, model parallelism, and efficient inference on edge devices (e.g., mobile, IoT).
- Demonstrated ability to scale teams and projects in fast-paced environments.
Preferred
- Publications in top-tier conferences/journals (e.g., NeurIPS, CVPR, ICML) on relevant topics.
- Experience with real-world deployments of AI models in datacenter and edge settings.
- Familiarity with ethical AI, bias mitigation, and regulatory compliance in model development.
- Strong programming skills in Python and experience with distributed systems (e.g., Kubernetes, Ray).
Skills
- Compliance Knowledge
- Development
- Machine Learning
- Python