Software Inference Deployment Engineer
Lumai —Oxford
- Full-time
- Hybrid work
- Master's degree
- Machine learning
- Linux
- Annual leave
Quick apply
8h
Full Stack Software Engineer (Cloud & Integrations)
BabelQuest —Oxford
- Hybrid work
- Azure
- AWS
- SharePoint
- Annual leave
1d
Software Engineer II
Tripadvisor —Oxford
- Java
- SQL
- AWS
- Employee assistance programme
- Employee discount
Quick apply
Senior Software Engineer (Full Stack)
Leidos —Ham
- £61,500 - £78,800 a year
- Full-time
- Java
- C++
- Scripting
- Annual leave
- Work from home
Cloud Architect
Roke —Woking
- Full-time
- Management
- DevOps
- AWS
Senior Software Engineer
IDBS —Woking
- Full-time
- Remote
- DevOps
- Java
- AWS
Intermediate Member of Technical staff, Embedded Linux Software Specialist / Engineer
MDA —Farnborough
- Full-time
- C++
- Driving
- Scripting
- Employee assistance programme
- Life insurance
Software Engineer, Data Infrastructure & Acquisition - Oxford, United Kingdom
Speechify —Oxford
- Remote
- Doctoral degree
- Computer Science
- Master's degree
3d
Field Application Engineer Motorsport - Software
Motion Applied —Woking
- Hybrid work
- Microsoft Excel
- SQL
- Bachelor's degree
- Annual leave
Quick apply
Senior Golang Software Engineer
InfoSum —Basingstoke
- Hybrid work
- Azure
- Java
- AWS
7d
Full Stack Software Engineer (UK)
Quantios —Fleet
- Full-time
- Computer Science
- Engineering
- Bachelor's degree
Quick apply
DevOps - Programmer
Computewell —Slough
- Oracle
- Computer Science
- Windows
Cloud Architect
Roke Manor Research Limited —Woking
- Management
- DevOps
- AWS
Full Stack Engineer
Centrica —Windsor
- Azure
- SQL
- Machine learning
13d
Early in Career Network Engineer (Site Reliability Engineer) - UK
Cisco Systems —Feltham
- Full-time
- Hybrid work
- Analysis skills
- Bachelor's degree
- APIs
Senior Embedded Linux Software Engineer
MDA Space —Farnborough
- C++
- Linux
- AI
- Employee assistance programme
- Life insurance
Software Engineer
Oxford Nanopore Technologies —Oxford
- Full-time
- AWS
- APIs
- Python
Lead Software Developer (AI Solutions)
PSI CRO —Oxford
- Full-time
- Remote
- Azure
- Master's degree
- SQL
Quick apply
Embedded Software Engineer (UAVs)
Archangel Autonomy —Oxford
- Full-time
- Computer Science
- Engineering
- Master's degree
Quick apply
Senior Scientist CADD Antibody Design Developer
UCB —Slough
- Doctoral degree
- English
- Machine learning

I want to receive the latest job alerts for python developer in reading, eng

By signing in to your account, you agree to SimplyHired's Terms of Service and consent to our Cookie and Privacy Policy.

Explore jobs in more locations

python developer jobs near reading, eng

Software Inference Deployment Engineer

Lumai -
Oxford

Quick apply

Job details

Full-time
8 hours ago

Benefits

Cycle to work scheme
Annual leave
Company pension
Private medical insurance

Qualifications

Law
PyTorch
FPGA
Software deployment
Master's degree
Docker
Machine learning
Linux
AI
Python

Full job description

The Opportunity

Lumai is redefining how the world computes. We are an ambitious, venture-backed UK startup pioneering a breakthrough AI accelerator for data centers which uses 3D optical compute. Our radical technology uses light to perform computation at orders of magnitude faster speeds and at far greater scales than ever before, all whilst consuming far less energy than traditional approaches.

Lumai is unlocking performance and efficiency gains that could transform the economics of AI and compute infrastructure and reshape how intelligence scales globally.

If you are passionate about bringing groundbreaking technology to market, and want to be part of a team pushing the boundaries of what is physically possible, Lumai is where you can make it happen.

About Lumai

Founded in 2022, Lumai is a University of Oxford spinout using optical processing to accelerate large language models (LLMs) and other transformer-based AI systems. The team combines expertise in optical computing, machine learning, and physics.

Lumai has already secured over $15 million in investment from leading deep-tech investors like Constructor Capital, IP Group, PhotonVentures and government grants, and is scaling rapidly to deploy the fastest optical compute currently available globally.

The Role

We are bringing the world's first optical AI compute platform to market. As we move from development into field deployment, we are looking for a Software Inference Deployment Engineer to own the software-side integration and customer support of Lumai Iris servers in third-party data centre environments.

You will begin by working alongside our software and engineering teams - helping integrate the Iris software stack, supporting model onboarding through the toolchain, and getting hands-on with the disaggregated prefill/decode runtime. This is intentional: the best way to develop deep expertise in a novel platform is to build with it. As deployments go live, you will take ownership in the field - supporting customer integration into their inference stacks, troubleshooting software issues, and acting as a primary technical contact for customer ML and infrastructure engineering teams.

This is an opportunity to work at the cutting edge of efficient AI inference - deploying a genuinely novel compute platform into production for the first time, and playing a central role in how it reaches the world.

What You'll Do

Work alongside Lumai's software and engineering teams to integrate, test, and harden the Iris software stack ahead of deployment
Support model onboarding through the Iris toolchain - loading, conversion, and framework integration
Develop hands-on familiarity with the disaggregated prefill/decode runtime, including how Iris servers operate alongside decode processors
Support customer integration of Lumai Iris into their own frameworks
Own software-side troubleshooting in the field, acting as the first line of response post-deployment
Train and enable customer ML and infrastructure engineering teams on the Iris software platform
Feed field findings, integration issues, and customer feedback back into product and engineering

What We're Looking For

Must-Have

Hands-on software engineering experience in AI infrastructure, inference serving, accelerator integration, or comparable deep-tech hardware-software environments
Strong Python skills and familiarity with major ML frameworks (PyTorch in particular)
Practical experience with model deployment workflows - loading, format conversion, quantisation, or framework integration
Comfortable working with inference serving stacks (for example vLLM, TensorRT-LLM, or similar)
Familiarity with Linux, containerisation (Docker), and cluster environments
Comfortable in a customer-facing role, able to communicate clearly with ML and infrastructure engineering teams
Comfortable working in a fast-moving, early-stage environment where the product and the deployment approach are both still being developed

Strong Preference For

Experience integrating accelerator hardware (GPUs, FPGAs, ASICs, NPUs, or novel architectures) into customer inference workflows
Familiarity with the NVIDIA inference stack - CUDA, TensorRT, Triton
Exposure to disaggregated inference architectures, prefill/decode separation, or KV cache management

Compensation & Benefits

Highly Competitive Salary: We are not saying our salary is a blank check, but let's just say it won't be a source of your stress
Share Option Scheme: We are all in this together! We believe in shared success while we build the Lumai of tomorrow
Pension Scheme: Plan for retirement with AVIVA
Private Health Insurance: We firmly believe that you come first, and a happy you is a healthy you! Look after yourself and your loved ones with AXA
Cycle to Work: Spread the cost of a bike, a bike and accessories or just accessories and save on tax
L&D Allowance: Stay at the forefront of your field with a £500 annual development budget
Subsidised On-site Lunches: Enjoy on-site healthy meals at half the price, as Lumai covers 50% of the cost
Holidays: Enjoy some deserved "me time" with 25 days paid holiday (plus bank holidays) per year
Socials: Be part of an inclusive community enjoying occasional all-company off-sites, lunches and socials

Interview Process

Our process is four stages. An initial conversation with our HR team to understand what you want from the role and what we want from it. Two technical sessions with our Product and Leadership team. Finally, an HR-team session covering scope, terms, and any final questions. We aim to move fast on candidates we are excited about; expect roughly three to four weeks end to end.

Lumai is an equal opportunity employer. We make hiring decisions on merit, scope-fit, and the strength of the working relationship we expect to build with each hire. Applications welcome from candidates of any background. If you are not sure whether you are a fit, send a note anyway.

Quick apply

Refine Your Search

python developer jobs in reading, eng

Software Inference Deployment Engineer

Full Stack Software Engineer (Cloud & Integrations)

Software Engineer II

Senior Software Engineer (Full Stack)

Cloud Architect

Senior Software Engineer

Intermediate Member of Technical staff, Embedded Linux Software Specialist / Engineer

Software Engineer, Data Infrastructure & Acquisition - Oxford, United Kingdom

Field Application Engineer Motorsport - Software

Senior Golang Software Engineer

Full Stack Software Engineer (UK)

DevOps - Programmer

Cloud Architect

Full Stack Engineer

Early in Career Network Engineer (Site Reliability Engineer) - UK

Senior Embedded Linux Software Engineer

Software Engineer

Lead Software Developer (AI Solutions)

Embedded Software Engineer (UAVs)

Senior Scientist CADD Antibody Design Developer

I want to receive the latest job alerts for python developer in reading, eng

Related Searches

Explore jobs in more locations

The Opportunity

About Lumai

The Role

What You'll Do

What We're Looking For

Compensation & Benefits

Interview Process

Jobseeker tools

Employer Tools

Browse

Stay Connected