Your Browser is Not Supported
To ensure jobstore run smoothly, please use the latest version of the following supported browsers:
At NVIDIA Lightspeed Studios we are passionate about pushing the boundaries of technology and are seeking a skilled Deep Learning Software Engineer to join our team. If you're enthusiastic about deep learning research in the area of real-time perception and gameplay control logic, content generation for games (2D/3D), animation, large or small language models, and have the technical background to apply deep learning solutions to real world problems, we want to hear from you!
Recent projects from our team:
RacerX: https://blogs.nvidia.com/blog/2022/09/29/racer-rtx-demo/
Avatar Cloud Engine for Games: https://www.nvidia.com/en-us/geforce/news/nvidia-ace-for-games-generative-ai-npcs/
What you'll be doing:
Keep up to date with the latest AI research in the area of real-time inference, generative content, animation and language models.
Apply existing research to real world problems in game development.
Collaborate with applied research teams to bring the latest AI technology to real products.
Play a pivotal role in steering applied research efforts to unlock innovative solutions and push the boundaries of what's possible in game development.
Use version control systems like Git, participate in code reviews, and support continuous integration processes and testing.
Collaborate on Linux or Windows environments.
What we need to see:
Strong experience with deep learning tools and APIs (PyTorch, CuDNN, TensorRT).
BS level education or equivalent experience in Computer Science or related field
4+ years of relevant work experience.
Strong proficiency in Python, Rust or C++, with a deep understanding of software design patterns and architecture.
Experience in optimizing neural networks for execution on CPU or GPU.
Experience developing modularized applications.
Proficiency in live debugging and performance profiling.
Background with 2D/3D content pipelines for games.
Understanding of large language models.
Knowledge of game engine development is preferred.
Prior exposure to any/all of the following: Unreal Engine 5, Unity Engine, Pixar’s OpenUSD and APIs or Digital Content Creation software.
With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most hard-working and dedicated people on the planet working for us and, due to unprecedented growth, our special engineering teams are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want to hear from you.
The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are now looking for a Senior Deep Learning Software Engineer!
At NVIDIA, we are pushing the boundaries of what is possible in AI. To advance the future of NVIDIA's hardware and software, we are seeking a talented Software Engineer who will deliver innovative quantization and sparsity algorithms into our flagship products. In this role, you will rigorously explore, implement, and evaluate new concepts, bridging the realms of hardware and software while setting new benchmarks in AI acceleration. Your work will play a crucial role in influencing our flagship GPU and software roadmaps, enabling major customer wins, and advancing our mission to fulfill the promise and future of AI.
What You'll Be Doing:
Implementing, optimizing, and benchmarking state-of-the-art sparsity and quantization algorithms on real-world AI models running on ground breaking hardware.
Collaborate closely with hardware, software, and research teams to assess and adopt deep learning algorithmic advancements in sparsity and quantization
Provide engineering support to customers using innovative HW and SW approaches
Work closely with production teams to develop and integrate the latest deep learning approaches into state-of-the-art libraries and frameworks
What We Need to See:
A minimum of a Master's degree in Computer Science, Artificial Intelligence, Applied Math or a related field, or equivalent experience
5+ years of relevant software development experience
A strong foundation in deep learning, specifically in quantization, numerics, and sparsity
Strong proficiency with modern frameworks such as PyTorch and TensorFlow
Familiarity with computer architecture and how it relates to AI algorithms development
Previous experience working directly with AI hardware and software development teams.
Ways to Stand Out From the Crowd:
Background with benchmarking accuracy and performance of AI models
Experience with CUDA programming and GPU performance optimizations
Published research or significant contributions to the field of AI, particularly in algorithm development for hardware-software co-design.
As NVIDIA makes inroads into the Datacenter business, our team plays a central role in getting the most out of our exponentially growing datacenter deployments as well as establishing a data-driven approach to hardware design and system software development. We collaborate with a broad cross section of teams at NVIDIA ranging from DL research teams to CUDA Kernel and DL Framework development teams, to Silicon Architecture Teams. As our team grows, and as we seek to identify and take advantage of long term opportunities, our skill-set needs are expanding as well.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!
The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are now looking for a Senior Deep Learning Performance Engineer!
NVIDIA is seeking highly skilled and dedicated engineers who are passionate about optimizing AI workloads, particularly Generative AI and Large Language Model (LLM) training. This role requires working across all levels of the hardware and software stack, including GPU architecture, compilers, kernels, and Deep Learning frameworks such as JAX and PyTorch. You'll have the opportunity to work on cutting-edge technology that will accelerate training performance for deep learning users all over the world. Are you ready to take on challenging problems and collaborate with innovative teams to engineer systems for high-performance Deep Learning? This could be the role for you!
What You Will Be Doing:
Understand, analyze, profile, and optimize large language model training on state-of-the-art hardware and software platforms.
Understand the big picture of LLM training performance on GPUs, prioritizing and then solving problems across all state-of-the-art LLM variations from research to industry.
Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL frameworks.
Implement and simulate key LLM workload behaviors in NVIDIA's proprietary processor and system simulators to enable future architecture studies.
Build tools to automate workload analysis, workload optimization, and other critical workflows.
What We Need To See:
PhD (or equivalent experience) in CS, EE or CSEE and 5+ years; or MS and 8+ years of relevant work experience.
Strong background in deep learning and neural networks, in particular training and large language models.
Deep understanding of computer architecture, and familiarity with the fundamentals of GPU architecture.
Expertise in analyzing and tuning application performance, preferably on GPUs.
Familiarity with common deep learning software packages like PyTorch and JAX.
Prior experience with processor and system-level performance modelling.
Programming skills in C++, Python, and CUDA.
Intelligent machines powered by AI computers that can learn, reason and interact with people are no longer science fiction. Today, a self-driving car powered by artificial intelligence can meander through a country road at night and find its way. An AI-powered robot can learn motor skills through trial and error. This is truly an extraordinary time. The era of AI has begun, and we are powering it. NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. Are you passionate about performance? Are you interested in working on industry-leading Deep Learning products? Come, join our Deep Learning Architecture team and help build real-time, cost-effective computing platforms driving our success in this exciting and rapidly growing field.
The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are now looking for a Senior Deep Learning Software Engineer, Inference! NVIDIA is seeking an experienced Deep Learning Engineer focused on analyzing and improving performance of DL inference! NVIDIA is rapidly growing our research and development for Deep Learning Inference and is seeking excellent Software Engineers at all levels of expertise to join our team. Companies around the world are using NVIDIA GPUs to power a revolution in deep learning, enabling breakthroughs in areas like LLM, Generative AI, Recommenders and Vision that has put DL into every software solution. Join the team that builds the software to enable the performance optimization, deployment and serving of these DL solutions. We specialize in developing GPU-accelerated Deep learning software like TensorRT, DL benchmarking software and performant solutions to deploy and serve these models.
Collaborate with the deep learning community to implement the latest algorithms for public release in TensorRT and DL benchmarks. Identify performance opportunities and optimize SoTA DL models across the spectrum of NVIDIA accelerators, from datacenter GPUs to edge SoCs. Implement optimizations using TensorRT, its open source tools like Polygraphy, TensorRT plugins, Triton and CUDA kernels. Work and collaborate with a diverse set of teams involving performance modeling, performance analysis, kernel development and inference software development.
What you'll be doing:
Performance optimization, analysis, and tuning of DL models in various domains like LLM, Recommender, GNN, Generative AI.
Scale performance of DL models across different architectures and types of NVIDIA accelerators.
Contribute features and code to NVIDIA’s inference benchmarking frameworks, TensorRT, Triton and LLM software solutions.
Work with cross-collaborative teams across generative AI, automotive, image understanding, and speech understanding to develop innovative solutions.
What we need to see:
Masters or PhD or equivalent experience in relevant field (Computer Engineering, Computer Science, EECS, AI).
At least 3 years of relevant software development experience.
You'll need excellent C/C++ programming and software design skills. SW Agile skills are helpful and Python experience is a plus.
Prior experience with training, deploying or optimizing the inference of DL models in production is a plus.
Prior experience with performance modelling, profiling, debug, and code optimization or architectural knowledge of CPU and GPU is a plus.
GPU programming experience (CUDA or OpenCL) is a plus.
GPU deep learning has provided the foundation for machines to learn, perceive, reason and solve problems posed using human language. The GPU started out as the engine for simulating human imagination, conjuring up the amazing virtual worlds of video games and Hollywood films. Now, NVIDIA's GPU runs deep learning algorithms, simulating human intelligence, and acts as the brain of computers, robots and self-driving cars that can perceive and understand the world. Just as human imagination and intelligence are linked, computer graphics and artificial intelligence come together in our architecture. Two modes of the human brain, two modes of the GPU. This may explain why NVIDIA GPUs are used broadly for deep learning, and NVIDIA is increasingly known as “the AI computing company.” Come, join our DL Architecture team, where you can help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.
The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are now looking for a Senior Deep Learning Software Engineer, DLSim! NVIDIA is seeking a Deep Learning Software Engineer to contribute to the creation of a compiler-oriented simulation infrastructure that swiftly assesses the forthcoming AI-accelerating GPU hardware and software advancements.
The DLSim Teams’ core mission is to deliver full-stack simulation infrastructure for deep learning applications across a spectrum of GPUs. We actively collaborate with architecture, software, product, and research teams to shape and refine the strategic roadmap of DL hardware and software.
What you’ll be doing:
Pioneer a novel deep learning compiler and simulation infrastructure that enables fast simulation of NVIDIA GPUs within DL compilers
Enhance DL compiler GPU kernel code generation with simulation-driven heuristics or cost models for optimal efficiency
Build a high-fidelity DL simulator with remarkable simulation speed
Partner across functional teams to understand and evaluate utilization, analyze, and identify opportunities for new or enhanced designs
Engage with Open Source communities
What we need to see:
A Masters or PhD degree in Computer Science, Computer Engineering, or a related STEM field (or equivalent experience)
3+ years of relevant work experience
Programming fluency in C/C++ and Python
In-depth understanding of compiler design and construction, particularly optimizations on code generation
Experience with architectural simulator design
Advanced knowledge of GPU and/or other AI accelerators
Effective communicator with solid prioritization skills, and a logical approach to problem-solving
Ways to stand out from the crowd:
A PhD in Computer Science and Engineering with a specialization in Computer Architecture, Compilers, or simulation techniques
Experience with the MLIR or LLVM compiler infrastructure and DL compilers such as OpenAI Triton and IREE
Made contributions to open-source projects such as MLIR or LLVM
Worked with the challenges of DL acceleration before, and ready to put your innovative ideas into motion and drive for success
Intelligent machines powered by AI computers that can learn, reason and interact with people are no longer science fiction. GPU-accelerated Deep Learning has provided the foundation for machines to learn, perceive, reason and solve problems. NVIDIA's GPUs run AI algorithms, simulating human intelligence, and act as the brains of computers, robots and self-driving cars that can perceive and understand the world. Increasingly known as “the AI computing company”, and widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package.
Are you creative, motivated, and love a challenge? If so, we want to hear from you! Come, join our DLSim team, where you can help build real-time, cost-effective computing platforms driving our success in this exciting and rapidly growing field!
The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are looking for a Software Test Development Architect in NVIDIA’s Deep Learning SWQA team.
The position is in NVIDIA Deep Learning Software Quality Assurance team that defines, develops test strategies to improve quality of NVIDIA‘s Deep Learning software and GPU Infrastructure for autonomous driving, healthcare, speech recognition, natural language processing, and a wide variety of other AI scenarios. This position collaborates with multiple AI product teams to develop new products; identify test gaps in test plans, improve test coverage, and improve our workflow processes for a diverse range of GPU computing platforms. You should grow with being in the critical path supporting developers working for billion-dollar business lines as well as intimately understanding the values of responsiveness, thoroughness, and teamwork. You should constantly develop and implement efficiency improvements across your domain. Join the team which is building software which will be used by the entire world!
What you’ll be doing:
Work closely with DL engineering teams to develop a keen understanding of DL QA goals, test strategies, and technical needs.
Collaborate with diverse inter-groups, including DL Researchers, Product, and engineering teams to identify gaps, and improve processing.
Lead bug lifecycle and co-work with QA test developers to analyze customer bugs and user scenarios to improve test coverages.
What we need to see:
MS/PhD (PhD preferred) in CS, EE, Math or closely related fields or equivalent experience.
10+ years of software development experience in DL/ML, DL Framework (Especially JAX and PyTorch), Neural Networks, DL Service deployment, user scenario analysis and SDKs.
Able to design test strategies for diverse DL products to optimize test plans and identify the most essential and risky use cases.
Excellent C/C++, Python programming skills; Strong written and oral communications skills in English.
Ways to stand out from the crowd:
Be familiar with deep neural network training, inference, optimization in typical Frameworks.
Experience software development with popular AI models (e.g., LLM models)
Background with GPU computing and parallel programming such as CUDA/OpenCL.
You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are looking for a Senior Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning Compiler team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in many areas, e.g. image classification, speech recognition, recommendation systems, large language models and generative AIs, etc. Join the team building the DLC which will be used by the entire deep learning community.
What you'll be doing:
In this role, you will be responsible for analyzing deep learning networks and developing compiler optimization algorithms. Collaborate with members of the deep learning software framework teams and the hardware architecture teams to accelerate the next generation of deep learning software. The scope of these efforts includes defining public APIs, performance tuning and analysis, crafting and implementing compiler and optimization techniques for neural networks, and other general software engineering work.
What we need to see:
Bachelors, Masters or Ph.D. in Computer Science, Computer Engineering, related field (or equivalent experience)
6+ years of relevant work or research experience in performance analysis and compiler optimizations.
Ability to work independently, define project goals and scope, and lead your own development effort.
Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.
Strong interpersonal skills are required along with the ability to work in a dynamic product-oriented team.
Ways to stand out from the crowd:
Knowledge of CPU and/or GPU architecture. CUDA or OpenCL programming experience.
Experience with the following technologies is a huge plus: XLA, TVM, MLIR, LLVM, deep learning models and algorithms, and deep learning framework design.
A history of mentoring junior engineers and interns is a bonus.
With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most brilliant and hardworking people in the world working with us and our product lines are growing fast in some of the hottest state of the art fields such as Virtual Reality, Artificial Intelligence, Deep Learning and Autonomous Vehicles.
The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
NVIDIA is at the forefront of redefining healthcare by harnessing the power of GPU computing and AI to redefine data analysis in fields such as genomics and personalized medicine. We are on the lookout for hardworking and committed individuals to join our mission in integrating AI driven genomic solutions into mainstream healthcare. As a Genomics Deep Learning Engineer, you will be part of an innovative team dedicated to the development of deep learning algorithms tailored for genomics applications. This role offers the outstanding chance to define, implement, productize, and deliver innovative computing software that will have a global impact on genomics community. If you are driven by solving complex computational challenges in the genomics field, this role is tailor-made for you!
What you'll be doing:
Develop and refine deep learning models and techniques for genomics analysis, including but not limited to DNA sequencing, variant calling, and model prediction.
Innovate deep learning-based methodologies to enhance and apply advanced Large Language Models (LLMs), Graph Neural Networks, Graph Transformer Networks, and construct extensive multi-modal models in genomics.
Design and implement machine learning techniques to tailor foundation models for downstream genomic specific tasks.
Generate and manage datasets for large-scale machine learning, focusing on learning from genomics specific applications.
Collaborate closely with product and hardware architecture teams to ensure flawless integration of research and development into NVIDIA products.
Work in tandem with engineering and AI research teams to employ the latest technologies for scalable and innovative genomics analysis.
What we need to see:
Bachelor's or Master's degree in Computer Science, Bioinformatics, or Computational Biology, or related field (or equivalent experience).
8+ years of relevant experience.
Proficiency in C/C++ and Python, with a strong grasp of software design and programming principles.
Background with Large Language Models (LLMs) and natural language processing (NLP), Generative AI and Foundation Models.
Strong proficiency with modern frameworks such as PyTorch and TensorFlow, Experience with Large scale inferencing.
Experience in building and implementing complex algorithms and data structures, with a focus on bioinformatics or genomics applications.
Deep understanding of computer system architecture, operating systems, and the challenges associated with large-scale genomic data analysis.
Ways to stand out from the crowd:
Ph. D. in Computer Science, Statistics, Bioinformatics, Computational Biology, or closely related fields.
Hands-on experience in using LLM, Graph Neural Network, Graph Transformer Network particularly those applied to genomics data.
Strong collaborative and interpersonal skills to effectively work and influence within a dynamic, technical environment.
Ability to decompose complex requirements into step by step tasks and reuse available solutions to implement most of those.
Join our team where you'll play a pivotal role in crafting and building real-time efficient AI/ML solutions that underpin our triumphs in the fast-paced multifaceted and quickly growing genomics field. You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/).
The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are looking for a Deep Learning Compiler Engineer. NVIDIA is hiring software engineers for its Deep Learning Compiler team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in many areas, e.g. image classification, speech recognition, recommendation systems, large language models and generative AIs, etc. Join the team building the DLC which will be used by the entire deep learning community.
What you'll be doing:
In this role, you will be responsible for analyzing deep learning networks and developing compiler optimization algorithms. You’ll collaborate with members of the deep learning software framework teams and the hardware architecture teams to accelerate the next generation of deep learning software. The scope of these efforts includes crafting & implementing compiler and optimization techniques for neural networks, performance tuning and analysis, defining public APIs and other general software engineering work.
What we need to see:
Bachelors, Masters or Ph.D. in Computer Science, Computer Engineering, related field (or equivalent experience)
5+ years of relevant work or research experience in performance analysis and compiler optimizations.
Ability to work independently, define project goals and scope, and lead your own development effort.
Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.
Strong interpersonal skills are required along with the ability to work in a dynamic product-oriented team.
Ways to stand out from the crowd:
Knowledge of CPU and/or GPU architecture. CUDA or OpenCL programming experience.
Experience with the following technologies is a huge plus: XLA, TVM, MLIR, LLVM, deep learning models and algorithms, and deep learning framework design.
A history of mentoring junior engineers and interns is a bonus.
With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most brilliant and hardworking people in the world working with us and our product lines are growing fast in some of the hottest state of the art fields such as Virtual Reality, Artificial Intelligence, Deep Learning and Autonomous Vehicles.
The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are now looking for a Senior Python Automation Engineer, for our Deep Learning Algorithms team! Join the team building software which will be used by the entire world of AI. Work with high class software engineers to implement a large scale toolset that tests deep learning models and frameworks on the most powerful computers. Ability to work in a multifaceted, fast-paced environment is required as well as strong social skills. In this role you will be interacting with internal partners, users, and members of the open source community to implement solutions for building, testing, integrating, and releasing of NVIDIA AI Services and Deep Learning Frameworks on the most powerful, enterprise-grade GPU clusters capable of hundreds of Peta FLOPS. This role spans multiple products such as PyTorch, TensorFlow, JAX, PaddlePaddle. You will work with internal engineering teams to deploy and operationalize AI models and services at scale by driving adoption for end-to-end Machine Learning and Deep Learning solutions in the cloud and on prem.
We are seeking passionate and hardworking python developers to help us scale our AI and deep learning services, platforms, models and internal tools. You will be responsible for implementing and maintaining tools, and infrastructure that enable our teams to productize NVIDIA SW stack: from DL Frameworks (PyTorch, TF, JAX, PaddlePaddle), DL models and AI services. Are you ready for this challenge?
What you’ll be doing:
Automating and optimizing testing of Deep Learning models and AI Services from different data domains with focus on inference
Developing shared utilities for setting up systems, running tests, recording results and visualization on dashboards.
Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. GitLab, Docker, Bash)
Lead best-practices for building, testing, and releasing software including AI Services and DL models
Identifying infrastructure needs and translating them into action
Building tools for automatic content generation mechanisms that saves dozens of engineering hours
What we need to see:
BSc or MS degree in Computer Science, Computer Architecture or related technical field or equivalent experience
5+ years of work experience in software development
Excellent Python programming skills, Great coding skills and a deep understanding of OOP concepts.
Familiarity with DevOps concepts such as CI/CD, Docker, Jenkins and automation tools.
Experience in building both front-end (e.g. JS, React, Vue, Dash, Streamlit) and back-end services (e.g Flask, FastAPI, Django) services
Understanding of Deep Learning allowing benchmarking on DL models
Willing to take action and strong analytical skills.
Strong time-management and organization skills for coordinating multiple initiatives, priorities and implementations of new technology and products into very complex projects.
Good communication and documentation habits
Ways to stand out from the crowd:
Experience with containerization technologies such as Docker
Experience in building monitoring or dashboarding solutions to support CI/CD pipelines.
Hands-on in configuring complex CI pipelines
Experience with HPC based compute clusters and scheduling solutions like Slurm. Solid understanding of Linux environments
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and forward-thinking people in the world working for us. If you're creative and autonomous, we want to hear from you!
The base salary range is 148,000 USD - 276,000 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are now looking for a Senior Deep Learning Software Engineer, for Algorithmic Model Optimization!
Join our team of algorithmic model optimization experts and take part in unlocking the biggest potential for AI with generative models such as large language models (LLM) and diffusion models. As a Senior Deep Learning Software Engineer, you will be at the forefront of pushing the boundaries of these models and enabling their deployment at a larger scale with unmatched efficiency. We are developing an innovative software platform that will not only be utilized internally but also have a significant impact externally by enabling the creation of groundbreaking AI products. This is an exceptional opportunity for passionate software engineers like you, who have a strong background in Deep Learning, to join us in solving the most significant challenges in the field.
Your role will be pivotal in our mission to maximize the potential of our rapidly expanding data center deployments. Additionally, you will play a crucial part in adopting a data-driven approach to hardware design and system software development. Collaboration is at the heart of what we do, and you will have the chance to work closely with a diverse range of teams at NVIDIA, including the Applied Deep Learning Research teams, CUDA Kernel and DL Framework development teams, and the Silicon Architecture Team. In this position, you will actively engage with internal stakeholders, users, and members of the open-source community. Your input will be vital in defining and implementing cutting-edge model optimization algorithms. The scope of your work will encompass researching and developing highly efficient search algorithms, defining public APIs, implementation, and various other software engineering tasks. We are seeking individuals who are as enthusiastic as we are about pushing the boundaries of AI and contributing to groundbreaking advancements in the field. If you are passionate about innovation, tackling complex DL problems, and working in a collaborative environment, this is the perfect opportunity for you. Join us, and together, we will shape the future of AI model optimization and its impact on the world.
Prototype and develop model optimization methods, and build a most impactful model optimization platform
Collaborate with internal and external partners to accelerate the adoption of deep learning model optimization
Stay up to date with the latest research and innovations in generative AI and model optimization techniques
Analyze and optimize the theoretical and practical performance of DL models generated
Publish findings in top AI conferences, and create Intellectual Property
Masters, PhD, or equivalent experience in Computer Science, AI, Applied Math, or related field.
10+ years of relevant work or research experience in Deep Learning.
Excellent software design skills, including debugging, performance analysis, and test design
Strong algorithms and programming fundamentals
Ability to work independently, define project goals and scope, and run your own development effort
Good communication, documentation habits, and interpersonal skills
Experience with one or more: Python, C++, performance tuning
Contributions to PyTorch, JAX, or other Machine Learning Frameworks
Knowledge of GPU architecture and compilation stack, and capability of understanding and debugging end-to-end performance
Familiarity with Nvidia’s deep learning SDK such as TensorRT
Strong understanding of deep learning algorithms and solutions
Strong understanding of ML model optimization techniques such as quantization, pruning, distillation.
Increasingly known as “the AI computing company” and widely considered to be one of the technology world’s most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Are you creative, motivated, and love a challenge? If so, we want to hear from you! Come, join our model optimization group, where you can help build real-time, cost-effective computing platforms driving our success in this exciting and rapidly growing field.
The base salary range is 220,000 USD - 419,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of all facets of Deep Learning such as training and inferencing, one of today's most important workloads in the world. If you are unafraid to work across all layers of the hardware/software stack from GPU architecture to Deep Learning Framework to achieve peak performance, we want to hear from you! This role offers an opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads the AI revolution while helping deep learning users around the globe enjoy ever-higher training speeds.
What you'll be doing:
Understand, analyze, profile, and optimize deep learning workloads on state-of-the-art hardware and software platforms.
Build tools to automate workload analysis, workload optimization, and other critical workflows.
Collaborate with cross-functional teams to analyze and optimize cloud application performance on diverse GPU architectures.
Identify bottlenecks and inefficiencies in application code and propose optimizations to enhance GPU utilization.
Drive end-to-end platform optimization from a hardware level to the application and service levels
Design and implement performance benchmarks and testing methodologies to evaluate application performance.
Provide guidance and recommendations on optimizing cloud-native applications for speed, scalability, and resource efficiency.
Share knowledge and best practices with domain expert teams as they transition applications to distributed environments.
What we need to see:
Masters in CS, EE or CSEE or equivalent experience
8+ years of experience in application performance engineering
Experience using large scale multi node GPU infrastructure on premise or in CSPs
Background in deep learning model architectures and experience with Pytorch and large scale distributed training
Experience with application profiling tools such as NVIDIA NSight, Intel VTune etc.
Deep understanding of computer architecture, and familiarity with the fundamentals of GPU architecture. Experience with NVIDIA's Infrastructure and software stacks.
Proven experience analyzing, modeling and tuning DL application performance.
Proficiency in Python and C/C++ for analyzing and optimizing application code
Ways to stand out from the crowd:
Strong fundamentals in algorithms and GPU programming experience (CUDA or OpenCL)
Understanding of NVIDIA's server and software ecosystem
Hands-on experience in performance optimization and benchmarking on large-scale distributed systems
Hands-on experience with NVIDIA GPUs, HPC storage, networking, and cloud computing.
In-depth understanding storage systems, Linux file systems, RDMA networking
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you.
The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are looking for an outstanding and passionate research engineer who stands at the forefront of generative AI to join the Developer Tools organization at NVIDIA. We aim at shipping the best tools powered by AI to help software developers achieve the most productivity and SOL performance on NVIDIA platforms. Come be part of our team!
What You Will Be Doing:
Study and work on SOTA generative AI models, through collaborations with NVIDIA research teams, to innovate new approaches on adapting the models for down-stream tasks that are suitable for developer tools.
Lead the technical direction and strategize on the next-generation AI-powered solutions for generating, debugging, and optimizing accelerated computing code.
Work closely with platform and product teams to architect, design, and integrate AI-based features into existing NVIDIA developer tools products.
What We Need to See:
PhD in computer science, mathematics, or related engineering fields (or equivalent experience).
At least five years of relevant technical work experience.
Solid knowledge of both theory and practices of deep learning, generative AI, large language models, and transfer learning techniques.
Strong programming skills in Python, C++, or CUDA C/C++.
Working experience with DL frameworks such as PyTorch, Tensorflow, or JAX.
Outstanding track record of research publications or developing AI-based products.
Excellent communication and interpersonal skills, and ability to work successfully with geographically distributed and multi-functional teams.
Ways to Stand Out from the Crowd:
Hands-on experience with end-to-end development of generative AI solutions.
Deep understanding of how generative AI can be leveraged for coding tasks.
Familiar with NVIDIA GPU technology and DL software stacks (NeMo, TensorRT-LLM, etc.).
Expert knowledge in GPU architectures and GPU programming.
Experience and/or interests in building developer tools like debuggers and profilers.
NVIDIA is widely regarded as one of the most innovative technology companies in the industry, and we have some of the most forward-thinking and versatile people in the world. Are you creative and autonomous? Do you love a challenge and make a difference? If so, we want to hear from you.
The base salary range is 180,000 USD - 339,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are looking for Deep Learning Compiler Engineers. NVIDIA is hiring software engineers for its Deep Learning Compiler team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in problems from image classification to speech recognition to natural language processing and artificial intelligence. Join the team which is building software which will be used by the entire deep learning community.
As a member of the Deep Learning Compiler Team, you will be responsible for developing compiler optimization algorithms for deep learning networks. You will be driving inference and training performance of JAX framework and XLA and OpenXLA compilers on NVIDIA GPUs at scale. You’ll collaborate with our partners in deep learning framework teams and our hardware architecture teams to accelerate the next generation of deep learning software.
What you'll be doing:
Crafting and implementing compiler optimization techniques for deep learning network graphs
Designing novel graph partitioning and tensor sharding techniques for distributed training and inference
Performance tuning and analysis
Code-generation for NVIDIA GPU backends using open-source compilers such as MLIR, LLVM and OpenAI Triton.
Defining APIs in JAX and related libraries and other general software engineering work
What we need to see:
Bachelors, Masters or Ph.D. in Computer Science, Computer Engineering, related field (or equivalent experience)
1+ years of relevant work or research experience in performance analysis and compiler optimizations.
Ability to work independently, define project goals and scope, and lead your own development effort adopting clean software engineering and testing practices.
Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.
Strong foundation in CPU and/or GPU architecture. Knowledge of high-performance computing and distributed programming. CUDA or OpenCL programming experience is desired but not required.
Experience with the following technologies is a huge plus: XLA, TVM, MLIR, LLVM, OpenAI Triton, deep learning models and algorithms, and deep learning framework design.
Strong interpersonal skills are required along with the ability to work in a dynamic product-oriented team. A history of mentoring junior engineers and interns is a bonus.
Ways to stand out from the crowd:
Worked on a deep learning framework such as JAX, Pytorch or Tensorflow.
Experience with CUDA or with GPUs
Proficient with open-source compilers such as LLVM and MLIR.
With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology industry's most desirable employers. We have some of the most brilliant and hardworking people in the world working with us and our product lines are growing fast in some of the hottest state of the art fields such as Virtual Reality, Artificial Intelligence, Deep Learning and Autonomous Vehicles.
The base salary range is 104,000 USD - 189,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
We are now looking for a Senior Performance Engineer focused on Deep Learning (DL) & High Performance Computing (HPC) to join our team. Our team is responsible for generating benchmark data across a rapidly growing catalog of Deep Learning & HPC applications on Nvidia and as well as competitive products. The data that we collect drives marketing and sales collateral as well as engineering studies for current and future products. In some instances, we write scripts that improve the team’s ability to collect data through automation and designing efficient processes for testing a wide variety of applications and hardware. You will have the opportunity to work with multi-functional teams and in a dynamic environment where multiple projects will be active at once and priorities may shift frequently. As a senior engineer, you will lead and guide the team through some of these projects.
NVIDIA builds the most advanced data center GPUs in world that are utilized in a growing number of computing areas ranging from life sciences to deep learning to quantum chemistry. NVIDIA strives to deliver the best possible performance, which allows researchers and scientists to do more world-changing work than ever before. Today, we are increasingly known as “the AI computing company.” We are looking to grow our company and build our teams with the smartest people in the world. Join us at the forefront of technological advancement.
What you’ll be doing:
Plan and execute GPU performance benchmarking across a wide range of HPC and DL frameworks and applications across on-prem, cluster, and cloud server platforms.
Aggregate, analyze, and generate written and visual reports with the testing data for internal sales, marketing, SW, and HW teams
Develop Python scripts to automate the benchmarking of test applications
Perform competitive analysis of GPU and CPU products
Work with internal engineering team to triage performance issues
Assist with the development of tools and processes that improve our ability to perform automated testing
Lead team in exploring and integrating new testing functions. eg LLMs, GenAI
Guide and mentor junior engineers
What we need to see:
Bachelor of Science degree in Engineering or Computer Science or equivalent experience.
8+ years of experience
Excellent programming and debugging skills in a scripting language such as Python or Unix shell
Advanced knowledge using Linux based systems
Proficiency in compiling software from source code, including debugging errors encountered
Knowledge of deep learning neural networks; how they work and familiarity with various DL frameworks
Experience using GPU-enabled HPC applications such as LAMMPS, GROMACS, Amber, RTM, etc…
Excellent English verbal and written interpersonal skills
Excellent data analysis skills and the ability to summarize findings in a written report
Familiarity using a container platform such as Docker or Singularity
Ways to stand out from the crowd:
Experience with GPU/CPU benchmarking on cloud solutions from AWS, GCP, Azure
GPU programming experience in CUDA, OpenACC, or OpenCL
Familiarity with software compilers such as GNU, Intel Composer, or PGI
Previous experience with benchmarking computer clusters
We have some of the most forward thinking and hardworking people in the world working for us and our best-in-class engineering teams are rapidly growing. We are building a team that will help shape the future of data center computing. If you are passionate about new technologies, care about improving efficiency and quality, and want to be at the forefront of AI & HPC, we would love for you to join us.
The base salary range is 164,000 USD - 310,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.
Official account of Jobstore.
A career in teaching is one of the most overlook job opportunities in the private sector. There are various fields in the education sector ranging from administration, curriculum design, student counselling, recreation, education policy, research, writing and mentoring.
The type of education jobs that can be found is principal, superintendents, academic officers, librarians, school teacher, school counselor, kindergarten teacher, language teacher, college professors and instructors.
The role of lecturer is to undertake teaching, research and administrative duties within a specialised subject area. Responsibilities include interviewing course applicants, arrange lecture with students, assessing students work, invigilating examinations, attend staff meetings and supervise students.
The role of teacher is to prepare and implement a full educational teaching plan according to the school’s requirements. Responsibilities include assess student progress, plan in-class activities and create educational contents. An experienced teacher will know how to organise a class and make learning a meaningful process.
The role of registrar is to maintain the academic records of all students and plan the registration process for classes. Responsibilities include determine student graduation eligibility, planning commencement activities and preparing honor lists as well as class rankings.
The role of special needs teacher is to develop special needs education according to students’ learning needs. Responsibilities include assessing children’s skills, design individualised educational plans and collaborate with parents to track students’ progress.
A trainer is responsible for identifying training needs and outlining the plans for teams and individuals. Training includes managing, designing, developing, coordinating and conducting all training programs. The type of positions that can be found are facilitator, operations, training coordinator/specialist, IT trainer, corporate trainer, assistant and personal training.