Job Description
Summary
As Head of AI (R&D), you will lead cross-functional teams to drive innovation in training multimodal large language models (LLMs), video foundation models, and other advanced AI systems. These models will be optimized for both cloud-based deployments and edge computing environments, ensuring scalability, efficiency, and real-world applicability. You will report directly to senior leadership and play a pivotal role in shaping the company's AI strategy, from research breakthroughs to production-ready solutions.
Key Responsibilities
- Lead and mentor interdisciplinary teams of researchers, engineers, and data scientists to pioneer cutting-edge AI research and development.
 - Innovate on model architectures, training methodologies, and optimization techniques for multimodal LLMs (integrating text, image, audio, and video) and specialized video foundation models.
 - Scale the R&D team by recruiting top talent, fostering a collaborative culture, and implementing processes for efficient growth and knowledge sharing.
 - Oversee scalable training pipelines, including data curation, distributed computing strategies (e.g., using HPC clusters or cloud resources), and hyperparameter optimization to handle massive datasets.
 - Deliver high-quality, general-purpose models tailored to specific applications and use cases, ensuring they meet performance benchmarks for accuracy, latency, and resource efficiency on cloud and edge devices.
 - Collaborate with product, engineering, and deployment teams to transition research prototypes into production systems.
 - Stay abreast of industry trends, publish research findings, and represent the company at conferences or in partnerships.
 - Manage budgets, timelines, and resources for R&D projects, ensuring alignment with business objectives and pragmatic AI practices.
 
Job requirements
- Proven experience leading AI R&D teams (5+ years in a leadership role) with a track record of delivering production-grade models from concept to deployment.
 - PhD in Computer Science, Machine Learning, Artificial Intelligence, or a related field.
 - Deep expertise in training foundation models, including multimodal LLMs and video models, with hands-on experience in frameworks like PyTorch, TensorFlow, or JAX.
 - Strong understanding of scalable training techniques, such as data parallelism, model parallelism, and efficient inference on edge devices (e.g., mobile, IoT).
 - Demonstrated ability to scale teams and projects in fast-paced environments.
 
Preferred
- Publications in top-tier conferences/journals (e.g., NeurIPS, CVPR, ICML) on relevant topics.
 - Experience with real-world deployments of AI models in datacenter and edge settings.
 - Familiarity with ethical AI, bias mitigation, and regulatory compliance in model development.
 - Strong programming skills in Python and experience with distributed systems (e.g., Kubernetes, Ray).
 
Skills
- Compliance Knowledge
 - Development
 - Machine Learning
 - Python
 

