Senior Operations Engineer - Product Operations
Full Time · Product Operations · On-Site
Cashfree HSR Office
Cashfree is seeking a highly skilled Senior Operations Engineer - Product Operations to join our team. As a Senior Operations Engineer, you will be responsible for ensuring the smooth operation of our product and services, identifying and resolving technical issues, and implementing process improvements to enhance overall efficiency.
This is an exciting opportunity for an experienced engineer who is passionate about operations excellence, problem-solving, and leadership. If you are a motivated and results-driven individual with a strong background in engineering and operations, we encourage you to apply.
Familiarity with SQL, Excel, and Google Sheets to analyze and present data-driven insights.
Understanding of monitoring and observability tools such as Prometheus, Grafana, and Datadog to ensure system visibility and performance.
Strong experience in problem analysis, proposal of multiple solutions, and ability to recommend and implement solutions to prevent performance degradation.
Excellent communication and problem-solving skills to resolve issues while focusing on conflicting priorities.
A strong sense of responsibility, good learning ability, self-drive, and team spirit to thrive in a fast-paced and dynamic environment.
Ability to accumulate best practices in operation and maintenance, guide on optimizing operations processes, and participate in process documentation.
4+ years of relevant experience in operations engineering or a related field.
Incident Response: Rapidly assess and resolve critical incidents, engaging with cross-functional teams to restore system functionality and maintain business continuity.
Monitoring and Alerting: Build and optimize monitoring systems to enhance visibility and enable timely detection of potential issues, reducing the impact of downtime and performance degradation.
Bank / PG Relationship: Maintain and foster relationships between Payment Gateway Providers (PGs) and banks for issue resolution, ensuring timely and efficient communication and response.
Automation: Design and implement infrastructure and tooling automation to streamline alert monitoring processes and reduce manual interventions, freeing up time for more strategic tasks.
Operations Excellence: Accumulate best practices in operation and maintenance, guiding on optimizing operations processes and participating in process documentation to ensure consistency and efficiency.
Proactive Problem Solving: Proactively identify potential risks and bottlenecks, recommending and implementing solutions to prevent performance degradation and ensure smooth operation.
Reporting: Compile and deliver daily and weekly reports on product metrics and incident metrics to provide visibility into system health and performance, enabling data-driven decision-making.
Autofill application
Save time by importing your resume in one of the following formats: .pdf or .docx.