Preferred Networks is hiring a
MN-Core LLM Serving Engine Engineer

About Preferred Networks

Preferred Networks is an AI company based in Tokyo working across the stack, from AI chips and computing infrastructure to LLMs and products. You may already know us indirectly if you\'ve used software we\'ve built, such as Optuna or CuPy (or Chainer, back in the day). We are designing in-house chips (MN-Core series) and training LLMs (PLaMo series). Our team is actively hiring for two roles related to these endeavors.

Job Description

Build software infrastructure to serve LLMs using our upcoming MN-Core L1000 inference accelerator. Work on the serving stack, API integration, and optimizing latency and throughput for production LLM workloads. Collaborate with ML researchers and hardware teams to deploy models in a scalable, maintainable way.

Remote

Remote Conditions

Remote within Japan; relocation to Japan required; visa and relocation support provided.

Salary

Not Specified

Benefits

Visa and relocation support

Tech Tags

Apis C##Distributed Systems Linux Open Source Python Vllm

Date Listed

02 May, 2026 (about 2 months ago)

Share this job

X / Twitter LinkedIn

Preferred Networks has 1 other job listed

Hiring engineers?

Reach thousands of tech candidates from the Hacker News community.

Post a Job — $99

Similar Jobs

Include this companyOnly other companies

Preferred Networks is hiring aMN-Core LLM Serving Engine Engineer