Fathom Radiant Logo Fathom Radiant
Fathom Radiant Logo

Software Engineer, ML Compiler and Framework

💰 $90,000 - $170,000 📅 02/09/2023

Apply

Architect, Network Modeling

💰 $150,000 - $230,000 📅 09/20/2023

Apply

Job Description

About us
We are searching for talented individuals who are driven to tackle the most
ambitious goal of our time - building the computer hardware that enables the
development of safe artificial general intelligence. See more at
fathomradiant.co/aboutus

In our people, we above all value kindness, a scout mindset, a focus on
improvement, and prioritising to get the right things done. We aim to help
build one of the most transformative technologies in the world, with massive
social and ethical implications. We think this makes representation even more
important, and we are actively striving to have a range of diverse
perspectives on our team.

This Role
As a Network Modeling Engineer, you will lead the modeling, analysis, and
optimization of highly parallel AI and high-performance computing
applications. In this role, you will be responsible for conducting network
simulations and workload characterizations. You will work closely with our ML
and Compiler Framework Software Engineer for end-to-end system analysis and
developing software frameworks for system-level performance modeling and
analysis.

This role is open to fully remote.

Areas of contribution:

* Conduct workload characterizations and end-to-end system analysis for highly parallel deployment of AI and HPC applications.
* Evaluate network topologies and compute utilization performance for large scale training clusters across parallelism strategies (data, model, and pipeline).
* Design, develop, and maintain software frameworks for system-level performance modeling and analysis, including simulation environments and analytical frameworks, evaluate benchmark performance and identify improvement opportunities.
* Collaborate with engineering and product teams to guide the development of new features, and drive the hardware-software co-design for improved performance and PPA.

Requirements (necessary skills for this role):

* PhD in Computer Science or Electrical Engineering
* Proficiency in object oriented programming languages, for example: C++, Rust or Java.
* Demonstrated experience in developing network simulation models
* Experience in architecting and optimizing networking protocols for congestion control, reliable transport, and co-flow completion time.
* Understanding of the computer architecture and microarchitecture concepts, and highly parallel computing paradigms.
* Experience in both assessing application-level performance and microbenchmarks
* Experience in optimizing algorithms and data-structures.
* Strong analytical, problem-solving, and communication skills.

Nice-to-haves (we will prioritize candidates that also have these skills):

* Understanding of AI/ML algorithms and parallelization techniques.
* Understanding of distributed systems collectives such as NCCL or OpenMPI
* Understanding of networking stacks and protocols, including Ethernet, Inifiniband and other proprietary Fabrics.

For all roles, we target market salaries, with an additional benefits package.
Our comprehensive benefits include startup equity, medical expenses coverage
(including extra coverage for employees with a family).

Fathom Radiant focuses on Hardware, Semiconductors, Cloud Computing, Machine
Learning, and Artificial Intelligence. Their company has offices in Boulder.
They have a small team that's between 11-50 employees.

You can view their website at <https://fathomradiant.co/> or find them on
[LinkedIn](https://www.linkedin.com/company/fathom-radiant/mycompany/).