Infrastructure Operations Engineer (GPU Computing)
Aethir
Overview
Role involves managing and optimizing GPU-based compute infrastructure for performance and reliability.
remotefull-timeEnglishLinuxAnsibleChefPuppetGITDockerKubernetesAWSAzureGCP+ 1 more
Locations
Requirements
Experience in infrastructure operations Proficiency in managing GPU-based compute infrastructure Strong expertise in Linux system administration Experience with configuration management tools Familiarity with containerization and orchestration technologies Excellent analytical and problem-solving skills Effective communication skills Experience with cloud computing platforms
Responsibilities
Manage and optimize GPU-based compute infrastructure Implement monitoring and alerting systems Develop automation scripts and tools Enforce security best practices Provide tier-3 support for infrastructure issues Collaborate on capacity planning Maintain documentation and knowledge sharing
Benefits