FuriosaAI Logo

FuriosaAI

AI Software Engineer (Platform Software)

Posted 16 Days Ago
In-Office or Remote
9 Locations
Junior
In-Office or Remote
9 Locations
Junior
Develop and optimize AI models in PyTorch for NPU architecture, analyze existing frameworks, and collaborate with the compiler team to enhance performance.
The summary above was generated by AI

About the Job
  • FuriosaAI is looking for passionate AI Software Engineers to join our Platform Team. You will participate in the research and development of models optimized for our NPU accelerator.

  • Our team builds the production-grade, streamlined AI software that makes up our SDK. This includes the runtime, LLM serving framework, and PyTorch models/extensions.

  • Your work on these critical parts of the SDK will directly enable AI developers to efficiently deploy optimized AI models on FuriosaAI NPUs.

Responsibilities
  • Develop and optimize DNN model implementations in PyTorch for FuriosaAI's Tensor Contraction Processor (TCP) architecture

  • Analyze the features, implementations, CUDA and Triton kernels of existing AI model inference frameworks such as vLLM, TensorRT-LLM, and DeepSpeed-MII

  • Research and implement generative AI models, parallelism strategies, and inference techniques to improve performance and efficiency

  • Collaborate closely with the compiler team to optimize and enable models.

Minimum Qualifications
  • BS degree in Computer Science, Engineering, or a related field, or equivalent industry experience

  • Proficiency in Python programming skill

  • Experience in developing AI models in DNN frameworks (e.g., PyTorch)

  • Solid understanding of machine learning, deep learning, natural language processing (NLP), and/or generative AI models

  • Strong communication skills with the ability to collaborate effectively across cross-functional teams

Preferred Qualifications
  • Hands-on experience with PyTorch 2.0 technologies (e.g., TorchDynamo) or DNN compiler technologies, such as Triton and MLIR

  • Proficiency in C++/CUDA or Rust programming skills

  • Hands-on experience deploying and optimizing large-scale ML models in production

  • Hands-on experience in model training and fine-turning of pre-trained models

  • Experience in LLM inference frameworks: vLLM, TensorRT-LLM, and DeepSpeed-MII

  • Strong background in model quantizations and model evaluations

  • Strong background in machine learning, generative AI, and model evaluation techniques

  • Proven track record of contributing to open-source projects

Contact

Top Skills

C++
Cuda
Mlir
Python
PyTorch
Rust
Triton

Similar Jobs

17 Hours Ago
Easy Apply
In-Office or Remote
3 Locations
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • eCommerce • Enterprise Web • Software • Design • Generative AI
As a Senior DevOps Engineer, you'll manage Webflow's build systems, oversee development environments, enforce best practices, and mentor junior team members to enhance efficiency and delivery in software development.
Top Skills: AWSCi/CdDockerInfrastructure As Code (Iac)KubernetesNode.jsPulumi
17 Hours Ago
Easy Apply
In-Office or Remote
3 Locations
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • eCommerce • Enterprise Web • Software • Design • Generative AI
The Staff Data Scientist will lead projects to drive insights, define business metrics, advance experimentation, and collaborate with various teams to enhance user experience and product performance.
Top Skills: PythonRSQL
17 Hours Ago
Easy Apply
In-Office or Remote
3 Locations
Easy Apply
Senior level
Senior level
Artificial Intelligence • Cloud • eCommerce • Enterprise Web • Software • Design • Generative AI
The Partner Manager will develop strategies for growing the reseller ecosystem, build relationships with partners, and work cross-functionally to drive mutual growth and success.
Top Skills: AIBusiness DevelopmentPartner MarketingSaaS

What you need to know about the Manchester Tech Scene

Home to a £5 billion digital ecosystem, including MediaCity, which consists of major players like the BBC, ITV and Ericsson, Manchester is one of the U.K.'s top digital tech hubs, at the forefront of advancements in film, television and emerging sectors like as e-sports, while also fostering a community of professionals dedicated to pushing creative and technological boundaries.

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account