Essential Duties and Responsibilities
- Research, develop, and productize state-of-the-art AI models and algorithms using techniques such multi-modal and sequence-based transformer-based architectures, and other emerging technologies.
- Work with diverse data types, including video, images, and text, to develop and optimize AI models for applications such as text-to-video, and video-to-text, segmentation, feature extraction, and machine translation.
- Design, implement, and evaluate novel algorithms and large-scale deep learning models to address complex problems in your area of assignment, optimizing for performance, scalability, and user experience across diverse domains.
- Collaborate with software engineers to deploy scalable, real-time AI systems in the cloud and in-premise, optimizing for efficiency, latency, and memory usage.
- Create and manage efficient data preprocessing pipelines for generative AI tasks, proactively identifying and integrating new data sources.
- Write software to integrate AI code into a production system.
- Maintain a deep understanding of current research, technologies, and emerging trends in AI to inform and guide the development of cutting-edge solutions.
- Enhance our model serving runtime for efficient serving of natural language models in both streaming and batch applications.
- Other duties as assigned.
Supervisory Responsibility
This position has no direct supervisory responsibilities but does serve as a coach and mentor for other positions in the department.
Education
4-Year bachelor's degree in computer science, Computer Engineering, Physics, Mathematics, or a related field.
Preferred Master or PhD degree in a related field. Publications in top conferences such as CVPR, ACL, NeurIPS, ICML, or ICLR are a plus.
Experience
Minimum of 9 Years of experience in an AI field (e.g. deep learning, machine learning, computer vision, natural language processing, or automated speech recognition). A master's degree may be considered equivalent to 2-4 YOE. A Phd may be considered equivalent to 1-3 YOE.
Knowledge, Skills, and Abilities
- Advanced skills in machine learning, deep learning, computer vision, and generative AI techniques, in areas such as transformer-based architectures, GANs, latent diffusion models, VAEs, cross-attention, and segmentation.
- Extensive experience with model optimization techniques for efficient inference in real-world applications.
- Strong desire for continuous learning and staying at the forefront of AI research and development.
- Extensive experience with large language models, multi-modal learning, and supervised and unsupervised learning.
- Advanced skills in programming languages like Python or C++, experience with data preprocessing and analysis tools such as NumPy and pandas, and excellent skills with deep learning frameworks such as PyTorch, TensorFlow, scikit-learn, Hugging Face Transformers, and Open AI API protocol.
- Proven experience with computer vision libraries and tools such as autoencoders, OpenCV, scikit-image, ffmpeg, and CUDA.
- Advanced skills with video, image, and text processing techniques as well as natural language processing concepts and methods.
- Proven experience with cloud computing services like AWS for training and deploying models at scale.
- Experience using optimized inference engines like OnnxRuntime or TensorRT and model serving runtimes like Triton Inference Server is desired.
- Strong understanding of software development processes for translating research into production systems and understanding of experimental design, data analysis, statistics, and data science principles.
- Familiarity of ASL communication skills.