See all roles

Senior Staff Software Engineer, High Performance Inference System

Work from home Full-time role Hiring

About Groq Groq delivers fast, efficient AI inference. Our LPU-based system powers GroqCloud™, giving businesses and developers the speed and scale they need. Headquartered in Silicon Valley, we are on a mission to make high performance AI compute more accessible and affordable. When real-time AI is within reach, anything is possible. Build fast. Senior Staff Software Engineer – High Performance Inference System Mission: Join the team that builds and operates Groq’s real-time, distributed inference system delivering large scale inference for LLMs and next-gen AI applications at ultra-low latency. Your work will optimize for heterogeneous hardware, dynamic global workloads, and extreme performance—all while running code at the edge of physics. Responsibilities & opportunities in this role:

  • Distributed Systems Engineering: Design and implement scalable, low-latency runtime systems that coordinate thousands of GroqChips across a software-scheduled interconnect.
  • Low-Level Optimization: Develop deterministic, hardware-aware abstractions that prioritize execution speed, fault tolerance, and reliability.
  • Performance & Diagnostics: Build tools and infrastructure to support real-time system observability, diagnostics, and SLO improvements.
  • Future-Proofing: Evolve Groq’s system stack to support emerging silicon, topologies, and heterogeneous accelerators (e.g., FPGAs).
  • Cross-Functional Collaboration: Partner with teams across compiler, infra, cloud, hardware, and data centers to align architecture and drive shared progress.

Ideal candidates have/are:

  • Consistently ship high-impact, production-ready systems code.
  • Have deep knowledge of computer architecture, operating systems, algorithms, and hardware-software interfaces.
  • Are fluent in low-level systems languages such as C++ or Rust, and comfortable with hardware-aware programming.
  • Rigorously profile and optimize for latency, throughput, and resource efficiency—every cycle counts.
  • Believe in automation and CI/CD best practices—you don’t ship untested code.
  • Thrive across the stack—from kernel internals to hardware integration to cloud load balancers.
  • Communicate clearly, make pragmatic technical decisions, and write maintainable code for the long term.
  • Ensures code stays fast, scales well, and takes ownership of outcomes.

Nice to have:

  • Operating large-scale distributed systems for real-time, high-traffic services.
  • Deploying and optimizing ML or HPC workloads in production environments.
  • Hands-on experience with GPUs, FPGAs, or ASICs in performance-critical systems.
  • Familiarity with ML frameworks (e.g., PyTorch) or compiler tools (e.g., MLIR).
  • Experience delivering complex projects in fast-paced, high-impact environments.

Attributes of a Groqster:

  • Humility - Egos are checked at the door
  • Collaborative & Team Savvy - We make up the smartest person in the room, together
  • Growth & Giver Mindset - Learn it all versus know it all, we share knowledge generously
  • Curious & Innovative - Take a creative approach to projects, problems, and design
  • Passion, Grit, & Boldness - no limit thinking, fueling informed risk taking

If this sounds like you, we’d love to hear from you! Compensation: At Groq, a competitive base salary is part of our comprehensive compensation package, which includes equity and benefits. For this role, the base salary range is $248,710 - $336,490, determined by your skills, qualifications, experience and internal benchmarks. Location: Some roles may require being located near or on our primary sites, as indicated in the job description. At Groq: Our goal is to hire and promote an exceptional workforce as diverse as the global populations we serve. Groq is an equal opportunity employer committed to diversity, inclusion, and belonging in all aspects of our organization. We value and celebrate diversity in thought, beliefs, talent, expression, and backgrounds. We know that our individual differences make us better. Groq is an Equal Opportunity Employer that is committed to inclusion and diversity. Qualified applicants will receive consideration for employment without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, disability or protected veteran status. We also take affirmative action to offer employment opportunities to minorities, women, individuals with disabilities, and protected veterans. Groq is committed to working with qualified individuals with physical or mental disabilities. Applicants who would like to contact us regarding the accessibility of our website or who need special assistance or a reasonable accommodation for any part of the application or hiring process may contact us at: [email protected]. This contact information is for accommodation requests only. Evaluation of requests for reasonable accommodations will be determined on a case-by-case basis. Apply tot his job

You might like

Android Senior Software Engineer, Mobile Platform

Work from home Full-time role

Staff Software Engineer

Work from home Full-time role

Embedded Systems Software Engineer

Work from home Full-time role

Sr./Pr. Software Engineer (TS pref) - Space Systems (Dulles) Job at Northrop Gru

Work from home Full-time role

Software Engineer III (Communications Platform)

Work from home Full-time role

Software Engineer for Training AI Data (JavaScript)

Work from home Full-time role

Software Engineer - Data Infrastructure

Work from home Full-time role

Senior Staff Software Engineer, Local Environments Team

Work from home Full-time role

Software Engineer, Infrastructure ( Platform DevX - Cloud Provisioning)

Work from home Full-time role

C++ Software Engineer, Remote Assist

Work from home Full-time role

Experienced Customer Service Representative – Remote Part-time Opportunities at arenaflex

Work from home Full-time role

Senior Software Engineer, Full-Stack

Work from home Full-time role

Amazon PPC Specialist | BAD Marketing | $12k-$30k | Remote (Worldwide)

Work from home Full-time role

Experienced Data Entry Specialist – Remote Online Typing Jobs with Flexible Hours and Competitive Compensation

Work from home Full-time role

IT Project Manager

Work from home Full-time role

Adobe Illustrator Tutor

Work from home Full-time role

Doordash Online Remote Jobs $25/Hour

Work from home Full-time role

Sustainability Reporting Specialist

Work from home Full-time role

Remote Data Entry Specialist – Flexible Hours – No Experience Required – Join arenaflex’s Global Entertainment Team

Work from home Full-time role

Experienced Sales Representative – Live Chat & Customer Support Specialist

Work from home Full-time role