Data Science Engineer (W/M)

  • EPFL
  • Lausanne, VD, Switzerland
  • 19/06/2021
Full time Data Science Data Analytics Big Data Data Management Statistics

Job Description

About us

EPFL, the Swiss Federal Institute of Technology in Lausanne, is one of the most dynamic university campuses in Europe and ranks among the top 20 universities worldwide. The EPFL employs more than 6,000 people supporting the three main missions of the institutions: education, research and innovation. The EPFL campus offers an exceptional working environment at the heart of a community of more than 16,000 people, including over 12,000 students and 4,000 researchers from more than 120 different countries.

The Swiss Data Science Center is a joint venture between EPFL and ETH Zurich. Its mission is to accelerate the adoption of data science and machine learning techniques within academic disciplines of the ETH Domain, the Swiss academic community at large, and the industrial sector. In particular, it addresses the gap between those who create data, those who develop data analytics and systems, and those who could potentially extract value from it. The center is composed of a large multi-disciplinary team of data and computer scientists, and experts in select domains, with offices in Lausanne and Zurich www.datascience.ch

Data Science Engineer (W/M)

Your mission :
We are seeking enthusiastic and experienced candidates with scientific IT expertise and a proven track record in and around data science and analytics on large-scale distributed platforms, services and applications, to staff up their national R&D Swiss Data Science Center. The ideal candidate will become part of the Swiss Data Science Center and will act as an enabler of data science activities within the research community from the ETH domains, Swiss universities, and the industry. In this role, you will:

  • liaise with data providers, data scientists, domain scientists, and industry partners,
  • understand goals, gather requirements, implement solutions,
  • ensure knowledge transfer between stakeholders.

Main duties and responsibilities include :

  • Liaise with data providers, data scientists, domain scientists, and industry partners to gather requirements.
  • Design, develop and set up novel (big) data science solutions into industrial and academic environments, using state of the art data science frameworks.
  • Prepare tutorials, presentations, blogs, publications, about data science technologies.
  • Provide trainings and promote the technology and services offered by the SDSC, in particular in connection with the Renku platform (https://renkulab.io/)

Your profile :

  • A bachelor’s degree (MSc or higher preferred) in computer science or a related discipline (e.g. statistics, bioinformatics, physics, mathematics).
  • A proven track record of crafting innovative and elegant software solutions, and a good command of the Python programming language. Familiarity with another programming/scripting languages, in particular R and bash, is a strong plus.
  • Previous experience applying machine learning and (big) data analytics frameworks such as TensorFlow, Pytorch, Scikit-learn, Tidymodels, and the Apache Hadoop ecosystem, to real world problems.
  • Familiarity with software development best practices, such as agile software development and CI/CD, and tools like Git as well.
  • Familiarity with Semantic Web Technologies, such as RDF, SPARQL, OWL, SHACL, is desirable.
  • Consistent experience with the Linux operating system. Experience with cloud technology, and containers like Docker or Kubernetes, are highly desirable.
  • Ability to work well in a cross-functional environment and excel in communicating with your peers.
  • An interest to explore and learn novel technologies and put them to practice in uncharted territories.
  • Excellent command of the English language, both verbal and written (required). Good working knowledge of French or German, would be a plus.

We offer :
We offer you a stimulating, startup-like, cross-disciplinary environment in a world-class research center that is part of two leading universities. In this dynamic position, you will make full use of your data science engineering and research skills and creativity to develop novel solutions for real cutting-edge questions. You will push forward the capabilities and performance of the team, contribute to decision-making about the direction of the SDSC platforms and investigate available technology options. You will work in a data science setting alongside leading domain and computer science experts from the ETH domain as well as industry. We have excellent ties to research groups worldwide, both academic and industrial. You will get access to state-of-the-art infrastructure and resources. Remote working is possible, although strictly from Switzerland
About Renku
Many data science projects today struggle to be efficient and reproducible. It is difficult to identify available data, and then even more to share it; those who share data are often not recognized for their contribution; it is a challenge to keep track of data versions; it is hard to see what code and data were used by whom to produce what results. Renku (https://datascience.ch/renku/, https://renkulab.io/) is an open collaborative platform developed by the SDSC to address these problems. Renku provides a knowledge infrastructure that seamlessly integrates interactive sessions (such as Jupyter, RStudio), automatic provenance tracking (which results were produced by whom and when), GitLab CI/CD, as well as version control systems for code, data and containerised environments. The key strength of Renku is its knowledge graph that captures the provenance of the analysis process by connecting versioned research objects, thus ensuring computational reproducibility. Renku makes it possible to have greater trust in results and acknowledge the contributions of all those involved, regardless of whether their contribution was to implement the solution, provide the data, or ask the right questions.Applications via email or postal services will not be considered. For further information about the Swiss Data Science Center please visit our website: www.datascience.ch. Questions regarding the position should be directed to OksanaRiba Grognuz (oksana.riba@datascience.ch) (no applications).

Start date :
As soon as possible

Term of employment :
Fixed-term (CDD)

Duration :
1 year, renewable

Remark :
Only candidates who applied through EPFL website or our partner Jobup’s website will be considered. Files sent by agencies without a mandate will not be taken into account