Software Engineer Reinforcement Learning

Location

Zurich or Remote (EMEA)

Salary

55000 - 95000 a year (s)

Description

Employment Type: 6 Month Contract

We are looking for a Software Engineer with a focus on data preparation and AI model training. You will work on assembling, annotating, and cleaning training data, while contributing to reward modeling and supervised fine-tuning tasks.


You might thrive in this role if you:

  • Have a deep understanding of machine learning and machine learning applications.
  • Working knowledge and experience tuning large language models (multimodal) and building evaluations.
  • Be willing to dive into large codebases to debug.
  • Someone who thrives in a dynamic and technically complex environment.
  • Track record of delivering outside-the-box novel solutions to solve real-world constraints.

Responsibilities

  • Data Assembly & Annotation: Gather and annotate training data for AI models, ensuring it meets the quality requirements for reward modeling and supervised fine-tuning.
  • Data Cleaning & Processing: Conduct data cleaning and preprocessing to ensure models receive high-quality input.
  • Model Training: Participate in the training and fine-tuning of models, ensuring that they meet performance and accuracy standards.
  • Collaboration: Work with AI engineers, data scientists, and other team members to ensure efficient workflows and data handling.
  • Continuous Improvement: Support iterative improvements to models based on performance monitoring and feedback.

Requirements

  • Experience: At least 3 years of experience working in a software engineering role focused on AI/ML tasks.
  • Data Expertise: Hands-on experience assembling, annotating, and cleaning training data for machine learning models.
  • Technical Skills: Proficiency in Python and experience with AI frameworks like TensorFlow or PyTorch.
  • Model Training: Familiarity with model training, reward modeling, and supervised fine-tuning techniques.
  • Attention to Detail: Strong focus on data quality and attention to detail when handling large datasets.

Bonus Points

  • Experience working with reward modeling for AI systems.
  • Familiarity with data labeling tools and techniques for supervised fine-tuning.
  • Knowledge of cloud platforms for AI/ML workloads.

About DFINITY and the Internet Computer:

DFINITY is a leading contributor to the Internet Computer Protocol (ICP), with a mission to bring the world's compute onto the secure ICP network. Built on its unique third-generation blockchain technology, ICP enables the development and operation of a new generation of unstoppable, tamper-proof, fully decentralized web applications. Its powerful technology can run entire AI models within smart contracts, representing a major advancement for secure AI. Through seamless integration with Bitcoin, Ethereum, and other networks, ICP facilitates multi-chain operations for digital assets and web3.

Join our team of over 250 talented individuals, including world-renowned cryptographers, distributed systems engineers, programming language experts, and industry leaders, who are shaping the future of the internet and web3.
DFINITY was founded in 2016 by entrepreneur and crypto theoretician, Dominic Williams.

All qualified applicants will receive consideration for employment without regard to race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, disability, age, or veteran status.


Please mention the word **EFFECTIVENESS** and tag RMjYwMDoxOTAwOjA6NDMwMzo6ZDAw when applying to show you read the job post completely (#RMjYwMDoxOTAwOjA6NDMwMzo6ZDAw). This is a beta feature to avoid spam applicants. Companies can search these words to find applicants that read this and see they're human.

Job type:

Remote job

Tags

  • software
  • crypto
  • python
  • training
  • support
  • web
  • cloud
  • assembly
  • operations
  • engineer
  • engineering
Sent 26 days ago
Back to index