Machine Learning Ops Engineer (remote)  (VAC-C20156M)

Τομέας/Κατηγορίες Εργασίας: Πληροφορική – Τηλεπικοινωνίες
Τύπος Απασχόλησης: Πλήρης Απασχόληση
Τοποθεσία: Κύπρος, Λεμεσός
Δημοσιεύθηκε πριν από: 3 εβδομάδες
Λήγει σε 2 εβδομάδες
Reference Number: FJ121355

Περιγραφή Θέσης

We are looking for an MLOps Engineer to join the growing team of a Limassol company that provides total brokerage solutions. This is a lead role for the DevOps environment. The responsibilities include both managing and building processes for automation as well as contributing to the development of internal tools to achieve operational efficiency. 100% remote work is possible.

 

Responsibilities:

  • Maintain the infrastructure clusters
  • Maintain models training infrastructure 
  • Deploy and maintain Kubeflow infrastructure
  • Design and implement alerts system for models quality and AI services availability
  • Deploy and maintain hyper-parameters tuning infrastructure
  • Prioritizing requests from the team fairly while demonstrating a sense of empathy
  • Maintain and enhance the CI/CD pipelines 
  • Collaborate with data engineering team to support production grade AI system
  • Develop automation flows that enable fast delivery and replace manual operating procedures wherever they exist to enable self-service operations
  • Drive analysis, design, and development of automation tools for deployment, development, and operational tasks
  • Deploy & manage monitoring/observability infrastructure for staging & production
  • Collaborate with DevOps team to enhance common infrastructure

 

Requirements:

  • 2+ years’ experience within hands-on technical DevOps/Cloud engineering
  • Good knowledge of Python or Golang
  • Experience with Kubernetes deployment patterns and tools such as Helm, Kustomize and Operators
  • Experience utilizing DevOps tool chains including Jenkins, Docker, SonarQube, GitHub
  • Experience with tools used for observability such as Elasticsearch, Kibana, Grafana, Prometheus, Jaeger etc.
  • Experience with SQL & NoSQL databases such as PostgreSQL and MongoDB
  • Experience with event steaming tools (i.e. Apache Kafka) and architecture patterns
  • Exposure to Agile environments (use of Jira/Confluence, sprints, etc.)
  • Good understanding of Machine Learning project life-cycle
  • Great communication skills and team player mentality

 

Desirable:

  • Experience with production grade machine learning systems
  • Advanced knowledge of Fairing frameworks or Kubeflow
  • Experience with development of custom Kubernetes operators
  • Experience with AutoML infrastructure
  • Infrastructure as Code experience (Terraform, CloudFormation, etc.)
  • Experience with Azure public clouds is a plus
  • Understanding of network engineering and security principles (e.g. protocols, routing, switching, filtering, firewall rules, etc.)

 

What we offer:

  • Challenging and engaging tasks
  • Professional growth opportunities
  • Flexible work and leave schedules
  • A competitive salary with an incentive program that rewards and recognises outstanding performance
  • Opportunity to work in an open and collaborative environment
  • Team bonding events

 

TO APPLY for this job opportunity, send your CV (in English please) to [email protected] and include the reference:  Machine Learning Ops Engineer (remote) – VAC-C20156M. We look forward to hearing from you!

 

Κύπρος, Λεμεσός