Salary: $100,000 - $150,000
Location: United States of America
Posted on: 07/05/2023
Apply##### Job Description :
Lambda's GPU cloud is used by deep learning engineers at Stanford, Berkeley,
and MIT. Lambda's on-prem systems power research and engineering at Intel,
Microsoft, Kaiser Permanente, major universities, and the Department of
Defense.
If you'd like to build the world's best deep learning cloud, join us.
##### **What You’ll Do:**
* Robustly scale Lambda’s compute, networking, and storage infrastructure across multiple environments
* Participate in an on-call rotation and write runbooks for common systems level failures
* Automate manual workflows by integrating new and existing pieces of infrastructure
* Research, evaluate, deploy, and maintain new infrastructure
* Work both independently and collaboratively on projects spanning all aspects of infrastructure
* Regularly contribute to documentation of Infrastructure systems and processes
* Have experience working with Linux systems, and are comfortable on the command-line
* Have excellent troubleshooting skills, and can work with others to identify and resolve issues
* Are comfortable scripting in bash or python
* Have experience working with configuration management tools such as Ansible or Saltstack
##### **Nice to Have:**
* Experience in the machine learning or computer hardware industry
* Experience with virtualization or containerization technologies, such as qemu/kvm, libvirt, openstack, lxd, or docker
* Experience with event-driven configuration management solutions
* Experience working with server hardware
* Foundational knowledge about networking
* Experience packaging or distributing tools for linux systems
* Familiarity with one or more other public clouds
* Software engineering experience
##### **About Lambda:**
* We offer generous cash & equity compensation
* Investors include Gradient Ventures, Google’s AI-focused venture fund
* We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability
* Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
* We have a wildly talented team of 100, and growing fast
* Our remote workforce, based on role, is across the U.S., with headquarters in San Francisco
* Health, dental, and vision coverage for you and your dependents
* Commuter/Work from home stipends
* 401k Plan
* Flexible Paid Time Off Plan that we all actually use