Sign up
Which kinds of involvement are you interested in?
What's your preference for remote vs on-site?
Which technologies are you interested in?
Language
Backend
Frontend
Platform
Infrastructure
Other

42 matching jobs

  • PyTorch
  • Machine Learning
  • Full-time

Senior Machine Learning Engineer – LLM & Reinforcement Learning (m/f/d) €160.000+ | Full-Time | ONSITE

As a Senior Machine Learning Engineer at warmwind your work will drive our next-gen AI models, shaping the future of machine intelligence beyond traditional paradigms. You'll work with massive compute clusters (500+ H100 GPUs) and cutting-edge reinforcement learning techniques to create highly efficient, scalable, and groundbreaking AI systems.

Responsibilities: Design, train, and optimize Large Language Models (LLMs) from scratch Scale distributed training on massive GPU clusters (500+ H100 GPUs) Implement advanced reinforcement learning techniques (RLHF, adversarial self-play, real-time control) Develop high-performance architectures for multi-modal AI systems Build simulation environments for RL-based AI agents Optimize inference speed and efficiency for real-world deployment

Your Profile: Deep expertise in LLMs – you’ve built and trained large-scale models yourself Experience with large-scale distributed training on 500+ GPU superclusters Deep understanding of reinforcement learning, neural network optimization, and self-play methods Expert in PyTorch, TensorFlow, JAX & low-level optimization techniques (CUDA, Triton, DeepSpeed, etc.) Familiarity with high-performance computing (HPC, NVLink, InfiniBand, parallel computing) Strong publication track record in AI/ML research is a plus Relocation to Jena, Germany after initial onboarding

Language requirements: German or English

https://warmwind.com/careers/senior-machine-learning-enginee...

  • C
  • C++
  • Java
  • Python
  • TypeScript
  • React
  • Linux
  • Windows
  • Jenkins
  • Machine Learning

Stellar Science | Hybrid (USA) Albuquerque NM, Washington DC (Tysons VA), Dayton OH | Full time, interns/co-ops | U.S. citizenship required | https://www.stellarscience.com

Company: We're a small scientific software development company that develops custom scientific and engineering analysis applications in domains including: space situational awareness (monitoring the locations, health and status of on-orbit satellites), image simulation, high power microwave systems, modeling and simulation, laser systems modeling, AI/ML including physics-informed neural networks (PINN), human body thermoregulation, computer vision and image processing, high performance computing (HPC), computer aided design (CAD), and more. All exciting applications and no CRUD. We emphasize high quality code and lightweight processes that free software engineers to be productive.

Experience: Except for interns, we currently require a Bachelors degree in physics, engineering, math, computer science, or a related field. Masters or PhD is a plus. (Roughly 25% of our staff have PhDs.)

Technologies: Lots of C++23, Qt 6.9, CMake, git, OpenGL, CUDA, Boost, Jenkins. Windows and Linux, msvc/gcc/clang/clangcl. AI/ML and analysis projects use Python and C++. Web projects use Java and Typescript/React.

Apply online: at https://www.stellarscience.com/careers/.

  • Python
  • Node.js
  • PostgreSQL
  • Redis
  • Docker
  • Kubernetes
  • Machine Learning

krea.ai | Senior Backend Engineer | San Francisco, CA | ONSITE | https://www.krea.ai

krea does AI research & builds AI tools for image generation, video generation, node-based workflows, LoRA training, and more. Small, mostly in-person team with a view of Alcatraz from the office window. Our users range from hobbyists all the way to professional designers at Apple or architects at firms behind The World Trade Center or Burj Khalifa.

We're looking for senior backend engineers. You'd work across our SvelteKit app (Postgres, Redis, Docker, ClickHouse), Python ML inference on GPU clusters, and k8s clusters across multiple cloud and GPU providers.

Some recent projects:

- building canary deploys with cookie-sticky traffic splitting - implementing durable execution for long-running workflows - designing our public API with OpenAPI docs auto-generated from Zod schemas - implementing enterprise-grade authentication, authorization, and permissions - optimizing ML inference for our hosted image generation models

We care way more about first-principles and core engineering skills rather than specific shenanigans around programming languages or particular tooling—knowing a lot about old UNIX principles is a plus though.

You should be comfortable owning things end-to-end. Experience with GPU infra is a plus. Many of us have some kind of creative background, it helps when building tools for creatives but is not a requirement by any means.

To apply, email d+hn@krea.ai (use the +hn suffix to make sure your email is prioritized!)

Posted 13 days ago by dvrp

  • Machine Learning

Espresso AI | Staff Engineers | NYC ONSITE | Full Tim

We use ML to make data warehouses and spark jobs more efficient. We're hiring staff ML engineers to train models that can understand how much compute a job needs, how it scales to larger machines, whether a machine can run more jobs, and so on; and staff infra engineers to take those models and deploy them on real-world production systems.

If this sounds cool, please email me an intro and a resume: ben [at] espresso [dot] ai

  • Python
  • PyTorch
  • Machine Learning
  • Full-time

Immunera | Principal Machine Learning Engineer | NYC area (hybrid) | Full-time | immunera.ai

Hey! I’m Maxim, cofounder/CEO of Immunera.

We're building blood tests that use gene sequencing and machine learning to help patients with autoimmune disease. Immunera spun out of years of research at Stanford, and our technology has been published in Science and covered by The New York Times.

We're hiring a Principal Machine Learning Engineer to shape the core models and infrastructure that power our blood tests. We are partnering with hospitals to generate training data from real patients for our language models and other biological sequence models.

This is a senior engineering role with significant autonomy at an early-stage, venture-backed startup. We're looking for strong experience building and operating ML systems in production (Python, PyTorch/TensorFlow, cloud platforms). Prior biology or healthcare experience is NOT required; we care much more about your ability to reason about data, models, and systems.

Full JD: https://www.immunera.ai/jobs

Contact: maxim@immunera.ai (subject: “HN ML job”)

  • AWS
  • GCP
  • Azure
  • Docker
  • Kubernetes
  • Machine Learning

Hey HN — Adobe is hiring an AI Agent Engineer. My team is building Adobe’s next‑gen, AI‑first marketing platform, and we’re looking for an engineer to design and ship production‑grade AI agents at enterprise scale. You’ll work on:

Building and orchestrating AI agents (LangChain, AutoGen, Semantic Kernel) Prompting, memory, tool‑calling, and agent‑to‑agent workflows Improving quality, reliability, and learning loops (HITL, retraining, drift) Integrating agents with enterprise data, compliance, and workflows

You should have:

5+ years in AI/ML, NLP, or backend engineering Strong experience with LLMs, RAG, and conversational systems in prod Comfort with cloud (AWS/GCP/Azure), APIs, Docker/K8s Bonus: multi‑agent systems, data labeling, synthetic data

If you’re excited about building high‑impact AI platforms at scale, I’d love to chat. Email abhishak@adobe.com with "HN" in the subject or apply here "https://adobe.wd5.myworkdayjobs.com/external_experienced/job..."

  • Python
  • TypeScript
  • GraphQL
  • PostgreSQL
  • React
  • Docker
  • Kubernetes
  • Terraform
  • Machine Learning

akeno | Senior/Staff Fullstack Engineer, DevOps Engineer, Applied Research Engineer | Full Time | ONSITE (hybrid) | Hamburg, Germany

At akeno, we help chemical manufactures optimise the production planning in their giant factories. Think asset utilisation, giant dependency graphs, adapting to real-time data, critical processes, a highly complex domain, and scenarios where even squeezing out 1% of extra utilisation can easily save our customers millions.

We're currently looking for 1x Senior/Staff Fullstack Engineer (Typescript, PostgreSQL, React, GraphQL/Hasura, TailwindCSS, Docker), 1x Senior/Staff DevOps Engineer (Kubernetes, Terraform, Observability, multi-environment management, …), and 1x Senior Applied Research Engineer (Python, ML/AI, optimisation) to extend our teams.

You'll need to bring real world experience from designing and running systems that really matter, a true "full stack" mentality (from UX deep down into the DB), team-spirit, and have worked in the EU before. We'll provide: Colleagues who are not only highly competent but you'll actually enjoy working with, 3-2 onsite/home-office model, great office in the heart of beautiful Hamburg, latest Macbook, excellent e2e/integration testing setup, pseudomised customer DB dumps for realistic development data, completely cloud-agnostic, "airplane mode" (you can spin up a full dev environment locally and without internet!), Open Telemetry, and more.

For more info: https://www.akeno.ai/careers and https://www.akeno.ai/tech-radar

  • Python
  • Rust
  • TypeScript
  • Android
  • AWS
  • iOS
  • Kubernetes
  • Terraform
  • Machine Learning

Radar Labs | Software Engineers (SRE, data platform, backend, full-stack, mobile) | Remote (US), NYC | Full Time | https://radar.com

- Radar is the geo-location dev tool

- Doing 1B+ API calls per day

- Our main languages are Rust and TypeScript, we also use mobile and offline pipeline languages (Python, Scala, and Terraform).

- We're based in NYC with our HQ in Union Square and remote friendly (US)

Interesting things we're working on:

- HorizonDB, our Geospatial database written in Rust

- Precise indoor location more accurate than iOS and Android leveraging Ultra-Wideband, other mobile sensors and ML.

- Extracting raw map data from satellite maps and the web leveraging ML and AI

- Anomaly detection to identify spoofed locations

- Mobile infrastructure that automatically configures itself optimizing battery-life and location accuracy for different use-cases over time

- Multi-Region AWS K8s deployment, 99.99%+ availability

- Frontend tools to visualize and debug location data at scale

Check out our jobs page here: https://radar.com/jobs#jobs If you have any questions, feel free to reply here or you can e-mail me at tim@radar.com

  • Python
  • Machine Learning
  • Full-time

Cradle (https://cradle.bio) | Scientific Software Engineer | Amsterdam | Onsite | Full-time

We're an AI platform for protein engineering (customers include Novo Nordisk, J&J, Grifols). We run our own wet lab in Amsterdam to generate training data for our ML models and to build the automation playbook we share openly with the field. We're hiring a software engineer to work embedded with our bioengineering team. You'd build things like: computer vision pipelines for colony picking, interfaces for lab automation workcells, integrations between lab instruments and Benchling. Python, APIs, databases, some frontend.

No PhD or biotech background needed. You'll pick up the biology. We care about solid engineering skills and curiosity about working at the intersection of software and hardware.

Apply: https://jobs.ashbyhq.com/cradlebio/7a4f7d2d-714e-4c01-84bb-9...

Posted 13 days ago by ros86

  • Machine Learning

Horison.ai | Applied Research Engineer + ML/AI Engineer | London, UK (Onsite/Hybrid)

Horison is building the intelligence substrate for private capital - not another RAG wrapper, but a foundational platform for how organisations structure, contextualise, and reason over proprietary information.

We're a funded, early-stage team of second-time founders, with backgrounds across fintech infrastructure, ML, and knowledge representation.

As Applied Research Engineer, you'll own the depth of our knowledge and retrieval infrastructure - ontology design, hybrid retrieval (dense/sparse/structured), information extraction, and knowledge quality systems. You'll translate research into production and shape the technical direction of a core system from day one.

You should have research experience in IR, NLP, or knowledge representation, and you should have shipped something real. Team of six, high ownership, no fluff.

anish [at] horison [dot] ai

  • Machine Learning
  • Full-time

VLM Run (https://vlm.run) | 1x Infrastructure Engineer + 2x AI/ML Engineer | Santa Clara, CA (HQ)

VLM Run is building infrastructure for production Vision-Language Model (VLM) systems — fast inference, tool-use + orchestration, reliable structured outputs, and the observability to iterate quickly. We’re a deeply technical team of veteran AI / computer-vision engineers (20+ years combined, MIT/CMU PhDs) who’ve shipped production ML infrastructure across autonomous driving and LLMs.

Open roles:

1. Infrastructure Engineer (Full-time, ONSITE): $150K–$220K + 0.5–3% equity https://app.dover.com/apply/VLM%20Run/8d4fa3b1-5b38-42e1-927...

2. AI/ML Engineer (Full-time, ONSITE): $150K–$220K + 0.5–3% equity https://app.dover.com/apply/VLM%20Run/1a490851-1ea1-4f12-a0f...

Email hiring "at" vlm.run with your GitHub + a couple recent projects.

P.S. We recently launched Orion, our visual agent that can reason and act over images, videos and documents. You can chat with Orion at https://chat.vlm.run and see capabilities at https://docs.vlm.run.

Apply: https://app.dover.com/jobs/vlm-run

  • Machine Learning

Hey HN, this is Louis, co-founder at SightlineOS.

Yusha, Derrick, and I are building an AI-native supply chain operating system for restaurant chains. It’s a massive, overlooked space with deeply underutilized data, and we’re using modern data and ML systems to automate planning, inventory management, and spend optimization. We’re venture-backed, NYC-based, and are hiring our first senior engineers as we start to scale. These are high-ownership roles with real production impact from day one.

You’d work closely with founders, own core systems end to end, and help shape the technical foundation of the company. You’ll own and scale the data foundations that power our ML and customer insights along with client implementation.

Location: NYC (hybrid, 3 days/week in office) Salary: $135,000 - $185,000 + equity

Apply here: https://www.sightlineos.com/sr-data-engineer

  • Machine Learning

Constellation Finance | Multiple Engineering Roles | New York, NY | Hybrid | Full Time

At Constellation, we are building a future where technology is the lethal advantage in credit investing.

Credit documents govern everything from how much debt a company can incur, or pay in dividends, to what they can do with proceeds from selling assets. Hidden debt capacity, loosely drafted provisions, or ambiguous definitions can materially alter recovery, making precise covenant interpretation essential to investing outcomes.

Constellation makes it easy to read credit documents, calculate covenant capacity, and plan liability-management transactions, reducing the mechanical bottlenecks to analyzing credits and generating alpha.

Our team is comprised of subject matter experts who have spent a decade in credit (Goldman Sachs/Citadel), AI/ML researchers, and full-stack engineers who are dedicated to solving frontier problems in their application to the world's largest asset class.

We recently raised a $3.25mm round led by Haystack and Company Ventures. We are hiring for these roles: https://www.constellationfinance.ai/#/careers

  • Java
  • Kotlin
  • Python
  • TypeScript
  • React
  • Machine Learning
  • Full-time

Starbridge | Senior Engineers (Kotlin/Java/React/Typescript) | NYC or Remote | Full-time | starbridge.ai

Starbridge is building an AI platform that turns large-scale public and enterprise data into reliable sales insights. We are early, moving fast, and building from zero to one, so this role will have huge ownership and product impact.

Product Engineer: (React/Typescript) who would work closely with product and design to build user-facing parts of the platform. You will craft performant, stable frontends that explain technical concepts to non-technical users and help us iterate fast based on customer feedback.

Backend Engineer: (looking for Kotlin/Java/Scala experience). You'll work across the backend: building enterprise integrations, large-scale scraping and parsing pipelines, and systems that let users apply LLMs to millions of documents to generate insights at scale.

AI Engineer: Applied AI plus strong software engineering. You will build, evaluate, and deploy LLM-driven features like deep document analysis and interactive chat, working with models from OpenAI, Anthropic, and Gemini. Expect hands-on Python, ML system design, experimentation, and production reliability. Bonus for RAG depth and frameworks like LangChain, LlamaIndex, or Hugging Face.

We're looking to build our in-person team in NYC but also open to remote!

Apply: https://starbridge.ai/careers and mention HackerNews or email recruiting@starbridge.ai with your resume.

  • C
  • C++
  • Go
  • Java
  • TypeScript
  • PostgreSQL
  • Machine Learning

Kinelo | kinelo.com | Full-Stack Engineer (AI + Backend) | San Francisco | Onsite

Kinelo manages the new type of hybrid team that will emerge in the future: humans+AI working shoulder-to-“shoulder”.

Our first product helps human software engineering teams deliver more efficiently and more reliably by assisting in coordination and engineering management. But it’s built on a foundational layer that ingests data, knowledge, and process across communications and systems, exposing that to humans and AI alike.

Eventually, Kinelo will become a proper human+AI orchestration layer, managing entire companies, and our goal is to have Kinelo run Kinelo (we already dogfood it today).

We’re looking for high ownership individuals to join our small team and shape its direction. We also just opened a new SF office and new hires will have the ability to help shape company culture, engineering practices, architecture, and roadmap.

We’re well-funded (just raised another round), founded by a serial founder, anti-bureaucratic, and highly technical.

We’re looking for:

- Those who want to go 0 to 1 with high ownership: moving features and products from idea on a napkin, to prototype, to engineering UI, to shipped and polished product

- ~8 years professional experience, including the pre-LLM days with strong CS fundamentals

- A balance of wisdom from experience and optimism about the future

- Strong TypeScript and Postgres (or other RDBMS) experience at scale and with high reliability (SaaS apps, APIs, etc)

- Experience with a variety of technologies, including containerization, distributed systems, and at least one strongly typed programming language (Go, C++, Java, etc)

- Ideally experience with ML and AI systems (though expertise in ML, data science, etc is not needed for this role)

We can’t sponsor visas but relocation assistance to SF is possible.

Please apply here: https://job-boards.greenhouse.io/kinelo/jobs/4088659009 and also email jobs@kinelo.com, mentioning this post.

Posted 13 days ago by deet

  • AWS
  • Machine Learning

Neoteny AI | Founding Researcher | REMOTE (US East Coast/Europe) / ONSITE (London, NYC)

Neoteny AI is building the sovereign intelligence layer for code. We are a team of builders from Meta, AWS, and NYU who believe that the future of coding intelligence is not a generalist chatbot in the cloud, but a specialized, repository-aware model that lives inside the enterprise perimeter.

We are looking for a founding researcher to lead our core research agenda. You will work at the intersection of representation learning, program synthesis, and efficient inference.

You will figure out how to teach models the latent physics of a codebase, the implicit architectural rules, dependency patterns, and style constraints that generalist models miss. Your work will range from designing new data engines to experimenting with novel architectures that break the memory wall of current transformers.

We are looking for someone with deep ML expertise (PhD or equivalent) who is code-fluent and comfortable writing kernels, not just training scripts.

Apply here: https://wellfound.com/recruit/jobs/3921270

  • Machine Learning
  • Full-time

Transfyr | Full Stack Software Engineer | Cambridge, MA | Full-time | ONSITE

Transfyr is building physical AI for science.

Why is it that a professional athlete has dramatically more information about every play they make than a scientist has about the cause of any experimental failure? At Transfyr, we are building the infrastructure to make real-world scientific work legible, transferable, and reproducible.

Modern science is capable of extraordinary outcomes, but much of the most important insights never become explicit: how experiments are actually executed, protocols drift, how experts make gametime decisions on the fly, why experiments fail on Tuesdays. This tacit knowledge is rarely captured, making it difficult to reliably reproduce results, much less hand off protocols to new team members or collaborators. We believe our systematic failure to capture tacit knowledge is holding back the entire industry.

We’re building systems that operate directly in real laboratory environments to elucidate, capture, and interpret this missing information. Our platform records and analyzes multimodal data about how scientific work is performed and turns it into durable, operational knowledge. In doing so, we are also building the world’s largest commercial dataset on real-world scientific execution.

This foundation is critical not only for driving elite human performance today, but for enabling meaningful automation tomorrow. Physical AI systems cannot learn from outcomes alone; they require rich, grounded records of how work is actually done in the real world.

Key domains of expertise we’re looking for (hiring from entry level to senior leadership):

- Computer Vision: take messy real-world images and video and make sense of them

- AI / ML Research & Engineering: build the best digital brains to support the best physical brains on the planet

- Full Stack Engineering: integrate our AI and data into tools that solve real customer problems.

A change from when I posted here previously: we are now focused on on-site in-person roles, at our lab/office in Cambridge, MA. And, we are particularly interested in DevOps/backend experience for the full stack role.

If you're interested, apply here! https://transfyr.ai/join-us

  • Android
  • iOS
  • Machine Learning
  • Full-time

Sesame | Full-time | SF/NYC/Bellevue | On-site | https://www.sesame.com/

Sesame believes in a future where computers are lifelike - with the ability to see, hear, and collaborate with us in ways that feel natural and human. With this vision, we're designing a new kind of computer, focused on making voice personal agents part of our daily lives. More details from Sequoia: https://www.sequoiacap.com/article/partnering-with-sesame-a-...

Our team brings together founders from Oculus and Ubiquity6, alongside proven leaders from Meta, Google, and Apple, with deep expertise spanning hardware and software.

Open Roles: https://jobs.ashbyhq.com/sesame

- ML Engineers

- Product Designers

- Product Managers

- iOS & Android Engineers

- ML Model Serving Engineer

- Embedded OS Architect

- Mechanical Engineer, Product Design

- Embedded Engineers

- Electrical Engineer

- Audio Systems Engineer

  • Elixir
  • Go
  • Python
  • GraphQL
  • PyTorch
  • React
  • Machine Learning
  • Full-time

River | Senior + Staff Engineers (Elixir, React, ML/AI) | SF or NYC or REMOTE (US, Europe, South America) | Full-Time | $150K-$250K + equity | https://jobs.ashbyhq.com/river

I'm Alex, founder and CEO of River. I also wrote the first line of code and still ship PRs.

River is a client-first Bitcoin-only financial institution building the financial app people use every day to save in bitcoin and spend in dollars. We custody over 25,000 BTC, serve individuals and businesses across the U.S., and are profitable. We publish cryptographic proof of reserves monthly and our company financials annually.

Here's what our engineering team is working on:

  - Real-time Bitcoin and USD payments, trading, and settlement on an Elixir monolith with a unified GraphQL API
  - Next-generation Bitcoin custody with quorum signing, key ceremonies, and geo-redundant infrastructure
  - Scaling our ML/AI systems (Python, PyTorch, XGBoost, LLMs) to automate operations and tackle fraud/risk
  - A ground-up React and React Native rewrite of our consumer app
  - Porting our Lightning Network infrastructure from Go into Elixir
We're hiring:

  - Senior Software Engineer (React, Full-stack) | $150K-$220K
  - Staff Software Engineer (Elixir) | $200K-$250K
  - Staff Software Engineer (ML/AI) | $200K-$250K
Apply at https://jobs.ashbyhq.com/river?utm_source=Zp9nJyqvd4 or email me at alex@river.com and mention HN.
  • Python
  • TypeScript
  • React
  • GCP
  • Machine Learning

Layer Health | https://www.layerhealth.com/ Amazing (and no ego) engineering team: https://www.linkedin.com/company/layerhealth/people/

Location: BOSTON (Boston Common) or NYC (downtown) - hybrid in both locations; <<NO REMOTE>> Headcount: ~38 and growing Stack:React / Typescript / Python / GCP

Open Roles (all roles require a MINIMUM of 6 years of work experience): Fullstack Engineer * ML Infrastructure (Staff+) * ML Engineer * ML/Research Scientist (PhD's) * Security Engineer * ML Engineering Manager (Director level)

We Offer: * competitive base salary + equity + great health/medical benefits * unlimited PTO * collaborative engineering-first culture

Layer Health was founded in 2023 by leading ML researchers from MIT and Harvard Medical School. We've built (and are live with multiple large health systems/hospitals) an ML product that scalably synthesizes patient medical records. Our LLM-powered platform is solving patient chart review - it dramatically accelerates clinical registry abstraction in areas ranging from surgery and cardiology, to oncology. Our long term vision is for our AI layer to safely transform patient care and minimize unnecessary heartbreak. Layer Health’s diverse founding team brings expertise across machine learning, UI/UX, large language models, and medicine.

Email me at mike.hauschild@layerhealth.com for more info