Principal Engineer, Cloud Engineering
The Principal Engineer, Cloud Engineering is a hands-on technical leader responsible for architecting, building, maintaining, and operating automated cloud infrastructure that power ICIMS products. This role includes Linux System Administration, Infrastructure-as-Code (IaC) using Terraform, and configuration management with Ansible across AWS. The Principal Systems Engineer sets best practices, mentors engineers, and partners with SRE, Security, and Product teams to deliver resilient, secure, and cost-efficient platforms at scale.
About Us
ICIMS is a leading enterprise hiring platform that combines the scale and reliability of enterprise software with the transformative power of AI. Thousands of organizations across more than 200 countries and territories trust ICIMS to find and hire the people who shape their future and drive their business forward. Powered by insights from billions of hiring interactions, continuous AI innovation, and a highly extensible platform, ICIMS helps organizations turn talent acquisition into a competitive advantage. For more than 25 years, ICIMS has delivered end-to-end hiring solutions that improve recruiting efficiency, reduce costs and create exceptional candidate experiences.
ICIMS helps solve one of the biggest challenges businesses face today: building a workforce that can adapt, scale, and perform in an increasingly competitive and unpredictable talent market. We uniquely do that by combining enterprise-grade hiring technology, AI-powered insights and automation, and connected talent experiences to help organizations improve hiring outcomes while driving measurable impact.
Responsibilities
- Administer and maintain Linux servers across development, staging, and production environments
- Perform OS hardening, patching, and security updates in compliance with organizational standards
- Manage user accounts, permissions, and groups following least-privilege principles
- Configure and maintain system services (e.g., SSH, NTP, DNS, logging) for reliability and security
- Monitor system performance, troubleshoot issues, and optimize resource utilization
- Implement automated configuration management using Ansible for consistency and drift prevention
- Develop scripts (Bash/Python) to automate routine tasks such as log rotation, backups, and health checks
- Collaborate with DevOps and SRE teams to integrate Linux systems into CI/CD pipelines
- Infrastructure-as-Code development in Terraform and code reviews
- Establish observability (logs, metrics, traces) and proactive alerting; drive SRE practices and incident response
- Participate in on-call rotations; perform post-incident reviews and continual improvement
- Collaborate with FinOps on right-sizing, capacity planning, and cost optimization
Security & Compliance Integration
- Enforce OS-level security hardening standards (e.g., CIS benchmarks) across all Linux servers.
- Support vulnerability management and automated patching processes
- Ensure compliance with organizational security policies and regulatory requirements (SOC2, ISO27001)
- Conduct periodic security reviews and participate in internal audits for Linux environments
- Manage secure credential storage and rotation for system-level accounts and services
Qualifications
Technical Expertise
- 7+ years in Linux administration and production support experience with on-calll responsibilities
- 5+ years of strong AWS experience and demonstrated knowledge
- Expertise in Terraform and Ansible for configuration management
- Hands-on experience with containers and orchestration (ECS/EKS)
- Scripting skills in Bash and/or Python for automation and tooling
- Solid understanding of networking (DNS, TLS, HTTP, routing, firewalls) and security fundamentals
- Observability tooling experience
- Some knowledge of Jenkins (administration, pipeline jobs, shared libraries)
Preferred
- Experience with AWS Organizations and multi-account governance
- AWS professional-level certifications (e.g., Solutions Architect, DevOps Engineer)
- Exposure to other database technologies or data platforms; MSSQL, MongoDB, MySQL
- Exposure to other cloud technologies like Azure and GCP
EEO Statement
iCIMS is a place where everyone belongs. We celebrate diversity and are committed to creating an inclusive environment for all employees. Our approach helps us to build a winning team that represents a variety of backgrounds, perspectives, and abilities. So, regardless of how your diversity expresses itself, you can find a home here at iCIMS.
We are proud to be an equal opportunity and affirmative action employer. We prohibit discrimination and harassment of any kind based on race, color, religion, national origin, sex (including pregnancy), sexual orientation, gender identity, gender expression, age, veteran status, genetic information, disability, or other applicable legally protected characteristics. If you would like to request an accommodation due to a disability, please contact us at [email protected].
Compensation and Benefits
iCIMS offers a comprehensive benefits package: medical, vision, and dental, life insurance, pensions, tuition reimbursement, lifestyle spending accounts, and generous paid time off plans (vacation, sick, parental leave). Learn more here: https://careers.icims.com/benefits
Qualifications:Technical Expertise
- 7+ years in Linux administration and production support experience with on-calll responsibilities
- 5+ years of strong AWS experience and demonstrated knowledge
- Expertise in Terraform and Ansible for configuration management
- Hands-on experience with containers and orchestration (ECS/EKS)
- Scripting skills in Bash and/or Python for automation and tooling
- Solid understanding of networking (DNS, TLS, HTTP, routing, firewalls) and security fundamentals
- Observability tooling experience
- Some knowledge of Jenkins (administration, pipeline jobs, shared libraries)