Turn your passion into meaningful work
Joining Infomaniak means becoming part of a leading technology company where you will be surrounded by the best talent to create ethical and sovereign cloud and productivity solutions.
Infomaniak is the company behind SwissTransfer and a trusted partner for leading organizations: international institutions such as the United Nations, media outlets such as France Télévisions, iconic events such as the Montreux Jazz Festival and the Annecy Festival, as well as central banks, major cities and security organizations across Europe.
Infomaniak, an independent, B Corp certified company with awards for its data centers that push the boundaries of efficiency and energy recovery, is living proof that it's possible to build a different kind of digital world: sovereign, sustainable, and beneficial to the local economy. Here, your passion will become meaningful work: you'll grow with autonomy, take on real responsibilities, and contribute to projects that impact millions of people.
We are looking for a:
AI Engineer
Context :
Infomaniak develops an open-source AI platform hosted in its own Swiss data centers. We deploy language models at scale and build intelligent agents for our products (kChat, kMeet, kDrive). We are looking for an AI Engineer to design, implement, and optimize our AI agents, with a focus on quality, reliability, and user experience.
Your responsibilities:
- Design and development of AI agents with LangChain, Pydantic-AI, and RAG.
- Integration of LLMs into our products (e.g., collaborative assistant, automatic summarization, content generation).
- Optimization of prompts, agent chains, and pipelines (latency, cost, accuracy).
- Collaboration with backend and DevOps teams to deploy and monitor agents in production.
- Automated testing and performance evaluation of agents (metrics, user feedback).
- Technical documentation and sharing of best practices within the team.
The profile that excites us:
- Experience with LangChain and/or **pydantic-ai (**agents, chains, tools, memory, RAG).
- Experience with open source LLMs (e.g., Llama 3, Mistral, Qwen) and serving frameworks (vLLM, TGI).
- Knowledge of FastAPI or equivalent for exposing agents via API.
- A taste for quality of responses, security, and performance, and knowledge of Langfuse.
- Technical curiosity, a taste for innovative challenges and optimization.
- Ability to rigorously benchmark in order to select the most suitable model
- Ability to work in a critical environment (high SLA, high availability).
A plus if you have knowledge in:
- Experience with Docker/Kubernetes, GitLab CI, Prometheus/Grafana is a plus.
- Open-source contributions or side projects are welcome.
- You enjoy working in a team and demonstrate positive communication skills.
- Your humor, flexibility, and team spirit are essential assets for working in a fun environment.
The technical stack we use
- LangChain
- Pydantic-ai
- vLLM
- FastAPI
- Gitlab
- Sentry
- Qdrant
The position:
- Permanent contract
- Occupancy rate: 80-100%
- Location: Geneva
- Availability: As soon as possible
The steps in the recruitment process:
- An initial technical interview to validate your skills.
- A second interview in our offices