DevOps Engineer
Founded in 2012, H2O.ai is at the forefront of the AI movement to democratize Generative AI. H2O.ai’s open-source Generative AI and Enterprise h2oGPT, combined with Document AI and the award-winning autoML Driverless AI, have transformed more than 20,000 global organizations and over half of the Fortune 500 and household brands, including AT&T, Commonwealth Bank of Australia, Chipotle, ADP, Workday, Progressive Insurance, and AES.
Our “AI for Good” program supports nonprofit groups, foundations, and communities in their efforts to advance education, healthcare, and environmental conservation, including identifying areas vulnerable to natural disasters and protecting endangered species.
We have a vibrant community of two million data scientists worldwide and aim to bring together the world’s top data scientists with customers to co-create GenAI applications that are usable and valuable by everyone. Business users can now leverage the power of LLMs to enhance productivity with enterprise applications.
H2O.ai is a Visionary in the 2024 Gartner® Magic Quadrant™ for Data Science and Machine Learning Platforms. We are the only provider in the market to offer both Predictive AI and Generative AI on premise and air gapped, in addition to supporting all cloud environments.
About This Opportunity
As a Senior DevOps Engineer at H2O.ai, you will be responsible for the deployment and operations of our core products, including H2O.ai Cloud. In this role, you will work on improving and extending automation, installation, and deployment processes while handling complex customer queries related to platform deployment and integration. You will collaborate with sales, customer success, and technical teams to ensure smooth deployment, operations, and ongoing customer satisfaction.
Your role will also involve engaging with potential and existing customers, providing pre-sales technical support, addressing advanced technical inquiries, and helping them understand the capabilities and advantages of the H2O.ai platform. By demonstrating deep technical expertise and solving complex deployment issues, you will help accelerate customer adoption and success. The role requires to be physically working in customer location in Singapore.
What You Will Do
- CI/CD Pipeline Automation: Establish and maintain CI/CD pipelines using Jenkins or GitHub Actions to automate software build, test, and deployment processes. Ensure efficient and reliable deployment to streamline platform integration for customers.
- Design, Deploy, and Manage Infrastructure: Architect, deploy, and manage infrastructure for H2O.ai Cloud across major cloud platforms (AWS, Azure, GCP) and on-premises Kubernetes (K8s) clusters. Set up and configure servers, networks, and storage to support the deployment process and ensure platform scalability.
- Monitoring and Performance Optimization: Set up and configure monitoring systems (e.g., Datadog) to track system performance and optimize resource utilization. Troubleshoot issues by analyzing logs and proactively ensuring platform stability.
- Disaster Recovery & Security: Implement security best practices to protect customer data and ensure compliance with industry regulations (e.g., SOC2, HIPAA). Define and implement disaster recovery strategies for critical customer environments.
What We Are Looking For
- Experience: Minimum 5+ years of relevant experience, with at least the last 3+ years focused on working with Jenkins for CI/CD pipeline development.
- CI/CD Tools: Hands-on experience with Jenkins and/or GitHub Actions or related CICD tools.
- Coding Skills: Strong expertise in Groovy, Python and/or shell scripting. Additional knowledge of Go is a plus.
- Kubernetes Expertise: Proven experience deploying applications on Kubernetes and a strong understanding of container orchestration.
- Helm Charts: Ability to understand and write Helm Charts for Kubernetes deployments.
- Cloud Platforms: Proficiency in AWS, Azure, and GCP, as well as on-premises infrastructure.
- Tools and Automation: Familiarity with the tools used/planned for automating H2O deployments and integrations with databases, platforms such as - hive, impala, cloudera, YAML configuration files for automation workflows and deployment orchestration
- Linux Expertise: Familiarity with Ubuntu and CentOS/RHEL environments.
- Monitoring and Performance: Experience with tools like Datadog for monitoring and troubleshooting.
- Customer-Facing Skills: Strong communication and problem-solving skills to address customer queries and foster adoption of H2O.ai solutions.
- Virtualization: Familiarity with technologies like VMware and Vagrant.
Why H2O.ai?
- Market Leader in Total Rewards
- Remote-Friendly Culture
- Flexible working environment
- Be part of a world-class team
- Career Growth
H2O.ai is committed to creating a diverse and inclusive culture. All qualified applicants will receive consideration for employment without regard to their race, ethnicity, religion, gender, sexual orientation, age, disability status or any other legally protected basis.
H2O.ai is an innovative AI cloud platform company, leading the mission to democratize AI for everyone. Thousands of organizations from all over the world have used our cutting-edge technology across a variety of industries. We’ve made it easy for people at all levels to generate breakthrough solutions to complex business problems and advance the discovery of new ideas and revenue streams. We push the boundaries of what is possible with artificial intelligence.
H2O.ai employs the world’s top Kaggle Grandmasters, the community of best-in-the-world machine learning practitioners and data scientists. A strong AI for Good ethos and responsible AI drive the company’s purpose.
Please visit www.H2O.ai to learn more.