We are looking for a DevOps / System / IT / Infrastructure Engineer to manage our hybrid on‑prem and cloud environment, keep our systems secure and highly available, and drive automation and AI-assisted operations across the organization. You will work closely with development, QA, and business teams to ensure robust infrastructure, smooth releases, and fast incident resolution.
Key responsibilities
-
Hands-on administration of servers, desktops, laptops, printers, routers, switches, firewalls, and IP phones in a mixed on‑prem and cloud environment.
-
Manage and optimize cloud infrastructure on AWS and/or Azure, including compute, storage, networking, security groups, and IAM policies.
-
Set up, configure, and maintain file servers, VPNs, firewalls, LDAP, and Active Directory/Entra ID for secure access and identity management.
-
Implement and maintain CI/CD pipelines (GitLab/Jenkins or similar) for application build, test, and deployment automation.
-
Monitor infrastructure, applications, and networks using modern observability tools; set up logging, alerting, and dashboards for proactive issue detection.
-
Perform regular OS upgrades, patching, performance tuning, backup, and disaster recovery for critical systems and services.
-
Troubleshoot and resolve production incidents, perform root cause analysis, and implement long-term fixes to improve reliability.
-
Collaborate with developers and QA to design scalable environments, improve deployment workflows, and standardize environments across dev, staging, and production.
AI, AIOps, and modern trends
-
Leverage AI-assisted tools (AIOps platforms, AI copilots, log intelligence) to speed up incident triage, root cause analysis, and problem resolution.
-
Use AI and automation to optimize cloud costs, capacity planning, and resource utilization across environments.
-
Implement or work with Infrastructure as Code (IaC) tools (Terraform, CloudFormation, ARM/Bicep, etc.) for consistent, repeatable provisioning of infrastructure.
-
Contribute to the evolution of cloud-native, containerized, and microservices-based architectures (Docker/Kubernetes or similar) where applicable.
-
Continuously evaluate and adopt DevOps trends such as GitOps, policy-as-code, progressive delivery, and enhanced security automation.
Required skills and experience
-
Solid experience in system and network administration, including Windows and/or Linux servers, desktop environments, and core office IT infrastructure.
-
Hands-on experience with AWS and/or Azure cloud services (compute, storage, networking, IAM, security, monitoring).
-
Practical knowledge of CI/CD tools (Jenkins, GitLab CI, GitHub Actions, etc.) and build/deployment automation.
-
Experience with scripting (Bash, PowerShell, Python) for automation, tooling, and integration with APIs.
-
Good understanding of security best practices, including firewalls, VPNs, endpoint security, identity/access management, and compliance basics.
-
Familiarity with monitoring, logging, and alerting tools (e.g., Prometheus, Grafana, CloudWatch, ELK, or similar).
-
Exposure to or strong interest in AI-driven operations (AIOps) and using AI tools to enhance reliability, automation, and support.
Nice to have
-
Experience with Infrastructure as Code (Terraform, CloudFormation, ARM/Bicep, Ansible, etc.).
-
Experience with containers and orchestration (Docker, Kubernetes, ECS, AKS, or EKS).
-
Background in supporting web, mobile, or SaaS applications in production environments.
Soft skills
-
Strong ownership mindset with the ability to independently drive tasks from start to finish.
-
Excellent problem-solving and debugging skills, especially under time pressure during incidents.
-
Clear communication and collaboration with developers, QA, management, and non-technical teams.