Beaverhand

Interview While You Work. Hire While You Sleep.

Wafer

Member Of Technical Staff (Summer Intern)

San Francisco, CA
Mid-level (2-5 years) Level
Posted 6/15/2026

About this role

Join our team to build the future of inference, GPU optimization and AI infrastructure. You'll work directly with the team to define our technical direction and build the core systems that power our GPU optimization platform. What You'll Do Build scalable infrastructure for AI model training and inference Lead technical decisions and architecture choices What We Look For Core Technical Expertise GPU Fundamentals: Deep understanding of GPU architectures, CUDA programming, and parallel computing patterns. Deep Learning Frameworks: Proficiency in PyTorch, TensorFlow, or JAX, particularly for GPU-accelerated workloads. LLM/AI Knowledge: Strong grounding in large language models (training, fine-tuning, prompting, evaluation). Systems Engineering: Proficiency in C++, Python, and possibly Rust/Go for building tooling around CUDA.

Most Desired Skills

Apply for this position

Step 1 of 4Basic Info & Resume

Let's start with the essentials

Drag and drop your resume here, or

PDF, DOC, or DOCX (Max 10MB)

Powered by Beaverhand

By submitting this application, you agree to our Terms of Service and Privacy Policy