Opkey
Website:
opkey.com
Job details:
About Opkey: Opkey is the leading Cloud Application Lifecycle Management (CALM) platform for Oracle, Workday, Salesforce, Coupa, and more. It cuts the costs and risks that drag down implementations and ongoing change, helping you go live on time, get more from your cloud app investments, and reach AI readiness faster. Opkey's 20 AI agents manage all five phases of the cloud application lifecycle?Define, Design, Configure, Test and Train. Whether it?s a new implementation, a platform update, or business?as?usual change, Opkey handles it all: updates validated in hours, self?healing tests, end?to?end integrations assured, configurations synced, and training updated in real time?all delivered in a single unified platform instead of a patchwork of disconnected tools. Powered by Argus, a domain?specific AI model trained on decades of expertise and terabytes of enterprise application data, Opkey automates configuration, testing, change impact analysis, and training across these applications?cutting manual effort by 80%, enabling 30% faster go?lives, and slashing downtime risk by 92%. Role: Cloud Engineer Experience: 3?6 Years Role Purpose: Own cloud infrastructure modules end-to-end, independently debug production issues, and participate in customer calls to troubleshoot, stabilize, and improve cloud-hosted environments. Core Skills Cloud Production Operations (Primary) AWS (Primary) EC2 production ops: instance lifecycle, AMIs, EBS volumes/snapshots, troubleshooting CPU/memory/disk/network issues, patching coordination, basic cost-aware sizing VPC: subnets (public/private), route tables, Internet Gateway/NAT Gateway concepts, NACLs vs Security Groups, VPC endpoints basics, peering basics S3: bucket policies, lifecycle rules, encryption basics, access troubleshooting (403/permissions), secure public access controls IAM: users/roles/policies, least privilege design, trust relationships, role assumption, access key hygiene, MFA enforcement awareness Route53: hosted zones, record types, routing policies basics, DNS troubleshooting (TTL/propagation) ALB: listeners, rules, target groups, health checks, sticky sessions basics, TLS termination patterns WAF basics: managed rules overview, allow/deny, common false-positive handling approach Networking Basics (Required) DNS, ports, routing concepts, CIDR/subnetting basics TLS handshake basics, certificate chains, common connectivity failure patterns Load balancer traffic flow basics (client ? LB ? backend) SSL / Certificate Management (Required) SSL installation & renewal (public internal/self-signed where applicable) PFX/PEM/JKS awareness, SAN vs CN, intermediate chain handling Troubleshooting common cert issues (hostname mismatch, chain incomplete, expired certs) Windows Web Hosting Basics (Required) IIS basics: sites/app pools, bindings, logs, common 502/503 troubleshooting ARR basics: reverse proxy fundamentals, routing rules, timeouts, headers, SSL offload basics Secondary Skills Azure (Secondary) VM operations: sizing, disks, troubleshooting performance/connectivity VNet: subnets, NSGs, routing basics, private/public access patterns Storage: blob basics, access policies/SAS awareness, lifecycle basics RBAC: role assignments, scope (subscription/resource group/resource), troubleshooting access issues Application Gateway: listeners, backend pools, rules, health probes, TLS basics Azure WAF basics: managed rules awareness, request blocking pattern. Infrastructure as Code (Required) Terraform (Mandatory ? multi-cloud modules): module usage/customization, variables/outputs, remote state basics safe plan/apply workflow, drift awareness, environment separation (dev/stage/prod) basics of state locking and rollback/recovery approach Identity Basics (Required) SSO basics: SAML/OAuth concepts, metadata/cert rotation awareness, common misconfig patterns Coordinate with app/security teams for troubleshooting sign-in failures and token/cert issues Good To Have OCI (Exposure) Compute production ops, VCN fundamentals, security lists, route tables IAM basics (policies/compartments), Load Balancer basics Vault basics (cert/secret storage) and operational awareness Responsibilities: Own cloud infrastructure components end-to-end (provisioning, hardening, operations, troubleshooting) Handle production escalations independently with structured triage and RCA inputs Participate in customer calls for incident debugging, environment reviews, and remediation planning Ensure secure configurations: least-privilege IAM/RBAC, restricted network exposure, SSL correctness, WAF baseline protections Maintain operational readiness: runbooks, standard checks, and repeatable troubleshooting steps across environments Please Note- If you have changed 3 companies in 5 years then we are not the right platform for you
Click on Apply to know more.