Tenstorrent logo

Staff Machine Learning Engineer, Model Training

Tenstorrent
Full-time
On-site
Belgrade, Serbia
Software Engineer

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must evolve to unify innovations in software models, compilers, platforms, networking, and semiconductors. Our diverse team of technologists have developed a high performance RISC-V CPU from scratch, and share a passion for AI and a deep desire to build the best AI platform possible. We value collaboration, curiosity, and a commitment to solving hard problems. We are growing our team and looking for contributors of all seniorities.

 
We’re looking for a technical leader to join our team in Belgrade, Serbia. This role is about owning the technical vision for our model training stack, architecting core compiler infrastructure, and leading a team of exceptional engineers by example. This is a hybrid role where you'll have a direct impact on the future of our hardware and software.
 
This role is hybrid based in Belgrade, Serbia.
 
We welcome candidates at various experience levels. During the interview process, candidates will be assessed for the appropriate level, and offers will align with that level, which may differ from the one in this posting.
 
Who You Are
  • A proven leader with 5+ years of experience driving technical roadmaps and mentoring engineering teams in the ML industry.
  • An expert in neural networks, with extensive, hands-on experience building and scaling distributed training systems.
  • Fluent in Python and C++, with deep knowledge of modern AI frameworks like PyTorch or JAX. Experience with compiler infrastructure or custom hardware is a major plus.
What We Need
  • Technical ownership of the training roadmap for our compiler, driving strategic planning and defining priorities for the team.
  • Scaling out to various modes of distributed model training with architectural oversight and performance optimization.
  • Benchmarking, analysis, and optimization of training performance across Tenstorrent's hardware and software stack.
  • Leadership through mentorship and by driving critical cross-team initiatives to ship reliable, scalable software.
You'll Learn How To
  • Architect and implement compiler optimizations specifically for training workloads on our custom hardware.
  • Identify bottlenecks, debug performance issues, and improve developer workflows across our stack.
  • Adapt and evolve our entire software infrastructure and leverage our hardware capabilities to stay ahead of emerging AI models.

 

Tenstorrent offers a highly competitive compensation package and benefits, and we are an equal opportunity employer.

This offer of employment is contingent upon the applicant being eligible to access U.S. export-controlled technology.  Due to U.S. export laws, including those codified in the U.S. Export Administration Regulations (EAR), the Company is required to ensure compliance with these laws when transferring technology to nationals of certain countries (such as EAR Country Groups D:1, E1, and E2).   These requirements apply to persons located in the U.S. and all countries outside the U.S.  As the position offered will have direct and/or indirect access to information, systems, or technologies subject to these laws, the offer may be contingent upon your citizenship/permanent residency status or ability to obtain prior license approval from the U.S. Commerce Department or applicable federal agency.  If employment is not possible due to U.S. export laws, any offer of employment will be rescinded.