Systems Development Engineer III, Annapurna Labs Infrastructure
Company: Annapurna Labs (U.S.) Inc.
Location: Pflugerville
Posted on: May 8, 2024
|
|
Job Description:
Annapurna Labs, our organization within AWS, is responsible for
building innovation in silicon and software for AWS customers. With
development centers in the U.S. and Israel, Annapurna is at the
forefront of innovation by combining cloud scale with the world's
most talented engineers. The Annapurna team covers multiple
disciplines including silicon engineering, hardware design and
verification, software, and operations. Because of Annapurna's
breadth of talent, we've been able to improve AWS cloud
infrastructure in networking and security with products such as AWS
Nitro, Enhanced Network Adapter (ENA), and Elastic Fabric Adapter
(EFA), in compute with AWS Graviton and F1 EC2 Instances, in
machine learning with AWS Neuron, Inferentia and Trainium ML
Accelerators, and in storage with scalable NVMe.As part of
Annapurna Labs Infrastructure team, you'll have the opportunity to
invent the next generation of cloud computing infrastructure.
You'll experience what it's like to work in a fast-paced,
innovative, and start-up like environment filled with some of the
brightest minds in the industry. The work we do is not only
cutting-edge and internet-scale but also deeply important to our
customers. The team's infrastructure is used to design and build
every component of our hardware and software to come together into
products that our customers use for accelerated computing: either
Machine Learning acceleration, or FPGA acceleration. If you want a
career that makes an impact, allows you to invent, and have
first-hand visibility into how your implementations delight
customers, then we have a role for you. If you're interested in
being on a team that is "building a complete product" from
inception to delighted customers, Annapurna is a fantastic
choice.Join us in creating the most advanced Machine Learning
Accelerators in the world!Key job responsibilitiesAs a technical
leader of the Cloud-Scale Machine Learning Acceleration
Infrastructure team you'll be responsible for architecting and
leading development of the infrastructure used by our engineering
teams. Our customers, the engineering teams, building
hardware/software running in our data centers which are custom
designed machine learning products: AWS Inferentia2 and
Trainium.You will need to lead across teams to develop and execute
in-depth infrastructure development plans that enables the
engineering development of the Machine Learning Acceleration
product family. You will dive deep to solve critical infrastructure
issues involving networking, high performance compute clusters,
infrastructure automation of hardware/software/firmware testing,
and ASIC/EDA development. You will execute and scale the next
generation of cloud infrastructure based on cloud frameworks and
AWS services. You will own design reviews for infrastructure
development and partner with AWS service teams and vendors. You
will influence within your team, your customers and AWS service
teams to help drive and develop the technical implementation for
overall system designs. You will identify and implement process
improvements which improve your team's agility and operations,
including improvements to design, automation, development, test or
operations. You will define new mechanisms that execute system
health monitoring, diagnostics, repair, and automation. You will
develop, document and update operational runbooks as you
participate in on-call rotations. A day in the lifeEach day you
will work with the best engineers in the industry to develop
Machine Learning Accelerators. On-site in Austin, Texas, you will
be apart of the team that develops custom silicon and you will own
the infrastructure that enables this innovation. Take a look inside
our labs to see what you will learn at Annapurna Labs:
https://www.aboutamazon.com/news/aws/take-a-look-inside-the-lab-where-aws-makes-custom-chipshttps://youtu.be/rViVFrQg4HkWe
are open to hiring candidates to work out of one of the following
locations:Austin, TX, USA
BASIC QUALIFICATIONS- 5+ years of programming with at least one
modern language such as C++, C#, Java, Python, Golang, PowerShell,
Ruby experience- 3+ years of non-internship professional software
development experience- 5+ years of designing or architecting
(design patterns, reliability and scaling) of new and existing
systems experience- 5+ years of deploying and operating in a
Linux/Unix environment experience- 3+ years of systems design,
software development, operations, automation, and process
improvement experience- Experience leading the design, build and
deployment of complex and performant (reliable and scalable)
software solutions in production- 3+ years of systems development
in an IT or data center environment experience- Experience with
debugging complex issues with HW/SW, networking and storage
systems- Experience with operations of large scale infrastructure
deployments including improving operational excellence
PREFERRED QUALIFICATIONS- Knowledge of engineering practices and
patterns for the full software/hardware/networks development life
cycle, including coding standards, code reviews, source control
management, build processes, testing, certification, and livesite
operations- Experience taking a leading role in building complex
software or computing infrastructure that has been successfully
delivered to customers- Experience writing technical documents,
project plans and progress reports to leadership and to
stakeholders- Experience with AWS Cloud Infrastructure deployments
using CDK- Experience with IT security
software/tools/standardsAmazon is committed to a diverse and
inclusive workplace. Amazon is an equal opportunity employer and
does not discriminate on the basis of race, national origin,
gender, gender identity, sexual orientation, protected veteran
status, disability, age, or other legally protected status. For
individuals with disabilities who would like to request an
accommodation, please visit
https://www.amazon.jobs/en/disability/us.
Keywords: Annapurna Labs (U.S.) Inc., Georgetown , Systems Development Engineer III, Annapurna Labs Infrastructure, Healthcare , Pflugerville, Texas
Click
here to apply!
|