Signed NDA
Site Reliability Interview Questions
2,640 site reliability interview questions shared by candidates
Linux questions, how and with commands used to investigate a VM with no ssh access? step through process
See above
1) HR Interview - Few minutes to introduce each other and getting some information of my current employer and work status in US, and so on. 2) Technical Interview - Phone interview for 30 mins with manager. - Questions on my resume regarding my job responsibilities as Cloud Operations Engineer, some insight on monitoring tools I was working on and what role I played in setting it up. 3) On Site Interview - Interviewed by HR, Team member, Manager, Director, and two other members from Cloud team. (around 3-4 hours) - Questions on my resume: skills in Python, AWS, and monitoring tool I was working on in the current company (it's architecture, functionality, and so on)
The following thing failed in k8s how do you debug?
Given a game to you which is running on an instance and hasMySQL installed on it locally, now with the game popularity increasing, suggest ways that it stays highly secure and highly available and then with every step he was adding more things on it, like we want to use JWT on it, should we use it? session maintenance etc.
How would you build an app that has to upload images, what if it had to do this? What if it had to do that? Where in the database. Code in extra features to existing software across several files, etc. etc.
Tell me about yourself
Usual HR screening questions, nothing especially memorable.
Years of experience working as an SRE Engineer or in a very similar role Years of experience working with cloud (AWS) Years of experience working with IaC tools (Terraform) and GitOps CI/CD solutions (ArgoCD, GitHub Actions, or similar). Years of experience working with monitoring and logging OpenSource tools such as Grafana, Prometheus, Elastic/OpenSearch, Loki, Tempo Years of experience working in Kubernetes, including its core components, deployment methodologies, and monitoring best practices. Strong scripting abilities (Python, Go, or similar) for automating observability tasks. Experience in managing observability: SLI, SLOs, Log Transformation, Cardinality Management, Business and Resilience Metrics, 4 Golden Signals, Distributed Tracing. Experience with automated alerting workflows.
Viewing 801 - 810 interview questions