Uoffer Logo
Uoffer_logo
Applied AI Research Engineering Intern - Fall 2025
NVIDIA
Intern
United States
公司规模:--
2025-07-02 15天前
2025-07-02 15天前
职位描述
What you'll be doing: 

Collaborate on the design and development of the Dynamo Kubernetes stack. 

Introduce new features to the Dynamo Python SDK and Dynamo Rust Runtime Core Library; design, implement, and optimize distributed inference components in Rust and Python. 

Contribute to the development of disaggregated serving for Dynamo-supported inference engines (vLLM, SGLang, TRT-LLM, llama.cpp, mistral.rs). 

Improve intelligent routing and KV-cache management subsystems. 

Contribute to open-source repositories, participate in code reviews, assist with issue triage on GitHub, work closely with the community to address issues, capture feedback, and evolve the framework’s APIs and architecture. 

What We Need To See: 

Pursuing Bachelors or Masters in Computer Science or a related field 

Excellent Golang, Rust and/or Python programming and software design skills, including debugging, performance and service health analysis, and test design 

Good understanding of algorithms and data structures, solid knowledge of RESTful APIs 

Highly motivated, dedicated, and curious about new technologies. You take pride in your work and strive to achieve incredible results and possess excellent communication, planning, and problem solving skills.
2025北美Summer + Fall 实训机会
立即抢占
2025北美Summer + Fall 实训机会
立即抢占