Research Engineer - CUDA



Posted on Wednesday, February 7, 2024
At IBM, work is more than a job – it’s a calling: To build. To design. To code. To consult. To think along with clients and sell. To make markets. To invent. To collaborate. Not just to do something better, but to attempt things you’ve never thought possible. Are you ready to lead in this new era of technology and solve some of the world’s most challenging problems? If so, lets talk.

Your Role and Responsibilities
The MIT-IBM Watson AI lab is seeking outstanding engineers to join a team of researchers and engineers developing cutting edge techniques and algorithms aiming to improve the efficiency of Large Language Models (LLMs) running on cloud or on edge devices. The job responsibilities include developing new algorithms and model architectures, conducting experiments to test hypotheses, developing solutions in CUDA, and working with other research and product teams on integration.

Required Technical and Professional Expertise
  • Background in computer science
  • Strong programming skills
  • Hands-on experience on CUDA kernel
  • Experience with developing, training, and testing deep neural networks

Preferred Technical and Professional Expertise

  • Experience with LLMs or multi-modal foundation models
  • Experience with machine learning tools and frameworks such as TensorFlow, PyTorch etc.
  • A track record of clearly and effectively communicating research ideas via publications and presentations at top-tier AI conferences (NeurIPS, ICLR, ICML, CVPR, ACL, etc.)