Highlights
Join our innovative team. Design compiler frameworks. Optimize AI models. Scalable infrastructures. High-intensity research environment.
Description
Are you a fresh graduate with a keen interest in AI and Machine Learning? Join our cutting-edge research team as an AI & ML Engineer/Researcher. We are looking for talented individuals to design, develop, and optimize compiler frameworks that will enhance the execution of AI models across various levels. Our team is dedicated to creating scalable transformer-based infrastructures for distributed multi-node training and efficient inference.
Responsibilities:
- Design and develop advanced compiler frameworks to optimize AI model execution at the kernel, graph, and operator levels.
- Architect scalable transformer-based infrastructures for distributed multi-node training and efficient inference.
- Create end-to-end AI pipelines that include graph optimizations, memory scheduling, and compute distribution.
- Collaborate with research teams to translate mathematical models into optimized execution graphs and intermediate representations (IRs).
- Implement custom kernels, quantization strategies, and low-level performance optimizations in C/C++ and CUDA.
- Analyze and tune runtime performance bottlenecks by focusing on parallelization, vectorization, and memory management.
- Develop domain-specific compiler passes for tensor operations, automatic differentiation, and operator fusion.
- Conduct systematic experiments to explore scaling laws, precision formats, and architectural optimizations for improved computational efficiency.
Mandatory Skills:
- Strong proficiency in C/C++ or Java with excellent command over pointers, memory management, performance optimization, and systems-level programming.
- Solid foundation in Mathematics, including calculus, probability, statistics, and linear algebra.
- Strong logical reasoning, problem-solving ability, high intelligence (IQ), and an analytical mindset.
- Ability to work effectively in a research-driven, high-intensity environment with cross-functional collaboration.
Preferred Skills:
- Experience or strong interest in compiler construction, runtime systems, and code generation.
- Proficiency in CUDA and Rust for high-performance and systems-level development.
- Deep understanding of computer architecture, operating systems, data structures, and memory management.
Skills