AI Inference Engineer Job at Signify Technology, Santa Clara, CA

eDRWZGdvVkpjK08vcm5tbk90czNjbVdaTXc9PQ==
  • Signify Technology
  • Santa Clara, CA

Job Description

AI Inference Engineer – Stealth Startup | San Fransisco Onsite

Compensation: $200K–$300K + equity

Join a stealth-stage team backed by prominent academic research and successful technical founders, working at the bleeding edge of AI infrastructure. As generative AI continues to scale rapidly, the bottleneck is no longer training—it’s inference. This team is rebuilding the core systems that power inference, from kernel-level GPU optimizations to full-stack distributed deployment.

This role is ideal for engineers who want to go deep: working on quantization, KV caching, attention mechanisms like FlashAttention, and designing new strategies for parallelism across heterogeneous compute. You'll contribute to an integrated software-hardware stack that enables large-scale model deployment with dramatically improved performance, efficiency, and quality—at production scale.

What You’ll Be Doing:

  • Research and implement state-of-the-art techniques to improve AI model inference speed and quality
  • Architect and optimize distributed AI infrastructure across both GPU kernel and software layers
  • Profile, benchmark, and debug system performance across varied hardware environments
  • Drive improvements in model execution through compiler-level tuning, caching, and runtime strategies

What They’re Looking For:

  • Bachelor's degree in Computer Science, Engineering, Applied Math, or a related field
  • Strong experience with performance optimization and systems-level thinking
  • Proficiency in Python, C++, and CUDA
  • Familiarity with AI frameworks like PyTorch, TensorFlow, ONNX, or vLLM

Nice to Have:

  • Graduate degree in a technical field
  • Experience with MLIR or other compiler frameworks
  • Hands-on work with large-scale GPU infrastructure or custom kernels

This is a hands-on, foundational role in a fast-moving environment, offering the chance to shape the backbone of the next generation of AI systems.

Job Tags

Similar Jobs

Collabera

Technical Sourcer Job at Collabera

 ...Take candidate from moment they open a req Req is open by recruiter Sourcer hunts and finds candidate Sourcer sets up intake...  ...candidate position is NOT their job. Meeting with managers, senior managers, and directors - need to be able to communicate effectively... 

Ferguson Enterprises, Inc.

Delivery Truck Driver- Class A CDL Job at Ferguson Enterprises, Inc.

 ...you can believe in. Would you like a truck driving career where you can be home daily...  ...: ~ Competitive compensation ~ Safe Driver incentive ~ Hourly bonus potential ~ Benefits...  .... ~ Ability to lift items that weigh up to of 50lbs. ~ A background in warehouse... 

Ecowize North America

Sanitation Site Manager Job at Ecowize North America

 ...including nights, weekends, and holidays. Be willing to work in all environmental conditions that exist in food processing plants (hot, cold, loud, and wet). Must maintain a professional appearance and demeanor and represent the company in a professional manner to... 

ATR International

Privilege Access Management Consultant Job at ATR International

 ...industry Job description: More information to come Requirements: Minimum 4+ years of experience in Privilege access management domain(Tools like HashiCorp, CYberark , Beyond trust etc). Strong experience with scripting language(like Powershell, javascript... 

Software Guidance & Assistance, Inc. (SGA, Inc.)

Executive Communications Specialist Job at Software Guidance & Assistance, Inc. (SGA, Inc.)

Software Guidance & Assistance, Inc., (SGA), is searching for an Executive Communications Specialist for a contract assignment with one of our premier SaaS clients. This role is hybrid in San Jose, CA or San Francisco, CA. This is a hybrid role. The Digital Media...