Aethir

Infrastructure Operations Engineer (GPU Computing)

Aethir

Overview

Role involves managing and optimizing GPU-based compute infrastructure for performance and reliability.

remotefull-timeEnglishLinuxAnsibleChefPuppetGITDockerKubernetesAWSAzureGCP

Locations

  • United States

Requirements

  • Experience in infrastructure operations
  • Proficiency in managing GPU-based compute infrastructure
  • Strong expertise in Linux system administration
  • Experience with configuration management tools
  • Familiarity with containerization and orchestration technologies
  • Excellent analytical and problem-solving skills
  • Effective communication skills
  • Experience with cloud computing platforms

Responsibilities

  • Manage and optimize GPU-based compute infrastructure
  • Implement monitoring and alerting systems
  • Develop automation scripts and tools
  • Enforce security best practices
  • Provide tier-3 support for infrastructure issues
  • Collaborate on capacity planning
  • Maintain documentation and knowledge sharing

Benefits

  • Competitive compensation
  • Flexible work hours
  • Remote work options
  • Flexible benefits
  • Flexible salary