Implement monitoring solutions to track system health, performance, and availability. Proactively monitor systems, identify issues, and respond to incidents promptly, working to minimize downtime and mitigate impacts
SREs drive continuous improvement efforts by identifying areas for enhancement, implementing best practices, and fostering a culture of reliability engineering. Participate in post-mortems, conduct blameless retrospectives, and drive initiatives to improve system reliability, stability, and maintainability
SREs collaborate closely with software engineers, operations teams, and other stakeholders to ensure smooth coordination and effective communication. They share knowledge, provide technical guidance, and contribute to the development of a strong engineering culture
...
Proactive Innovation: You’ll build and maintain advanced monitoring frameworks to catch potential issues before they ever impact operations.
Collaborative Scaling: Work closely with cross-functional teams to translate complex business needs into scalable, maintainable technical reality.
As our Senior Developer, you are the guardian of system reliability. You will lead the enhancement of our custom big data application, moving beyond surface-level fixes to implement long-term, high-impact architectural improvements.