Skip to main content

Careers Home > Job Search Results > SRE Architect

SRE Architect

Location: Chicago, Illinois, United States
Job ID: 1030221DS
Date Posted: Sep 4, 2024
Segment: Digital System & Service
Business Unit: Hitachi Services & Platforms
Company Name: Hitachi Digital Services
Profession (Job Category): IT, Telecom & Internet
Job Type (Experience Level): Experienced
Job Schedule: Full-time

Share: mail
Save Job Saved

Our Company

We're Hitachi Digital Services, a global digital solutions and transformation business with a bold vision of our world's potential. We're people-centric and here to power good. Every day, we future-proof urban spaces, conserve natural resources, protect rainforests, and save lives. This is a world where innovation, technology, and deep expertise come together to take our company and customers from what's now to what's next. We make it happen through the power of acceleration.

Imagine the sheer breadth of talent it takes to bring a better tomorrow closer to today. We don't expect you to 'fit' every requirement - your life experience, character, perspective, and passion for achieving great things in the world are equally as important to us.

The role
  • Lead one of the key Programme for Hitachi Application Reliability Centre across, Observability, FinOps, Service Management, DevSecOps and Resiliency & Reliability engineering.
  • Lead in identification of process & infrastructure gaps, design and implementation of process Improvements to increase operational reliability
  • Identify & apply functional & non-functional improvements: Acts as the overall Operations representative in Value Stream planning and prioritises sessions to ensure that Operational needs of assigned applications/platforms are addressed as needed.
  • Holds quarterly Operational Performance reviews with Value Stream management.
  • Create and review technical requirements and translate into platform enablers in features/user stories
  • Increase operational efficiency & promote stability through automation from domain perspective.
  • Design & drive monitoring, alerting, ticket reporting strategies in order to measure SLA, SLOs like MTTR, MTTD, MMTA etc. and align with management expectations to improve application resiliency & reliability.
  • Develop Dashboards for alerting and monitoring to ensure application systems service reliability and availability
  • Write quality code using SOLID principles in Test Driven Development with more than 2 programming languages
  • Operational Performance & Stability: Works with other members of their assigned Value Stream to ensure that the in-scope applications/platforms are meeting performance and stability requirements. This includes managing Major Incidents to Mitigation/Resolution.
  • Problem Management: Performs Post-Incident Reviews of all Major Incidents and determining Action Items required to avoid similar issues/minimize downtime for future Incidents.
  • Monitors and Metrics: Works with Application Development to ensure that
  • Assigned applications/platforms have the appropriate monitoring and metrics in place to appropriately measure
What you'll bring
  • Proven experience in SRE, DevOps, or a related field with a focus on operational reliability, automation, and performance management.
  • Strong leadership skills with experience in managing cross-functional teams and leading large-scale programs.
  • Proficiency in multiple programming languages and a strong understanding of SOLID principles and Test-Driven Development.
  • Extensive experience with monitoring and alerting tools, as well as developing dashboards to track key performance indicators.
  • In-depth understanding of SLA, SLOs, MTTR, MTTD, and MMTA, and experience in designing strategies to improve these metrics.
  • Strong problem-solving skills, with experience in conducting post-incident reviews and driving continuous improvement initiatives.
About us

We're a global, team of innovators. Together, we harness engineering excellence and passion to co-create meaningful solutions to complex challenges. We turn organizations into data-driven leaders that can make a positive impact on their industries and society. If you believe that innovation can bring a better tomorrow closer to today, this is the place for you.

Championing diversity, equity, and inclusion

Diversity, equity, and inclusion (DEI) are integral to our culture and identity. Diverse thinking, a commitment to allyship, and a culture of empowerment help us achieve powerful results. We want you to be you, with all the ideas, lived experience, and fresh perspective that brings. We support your uniqueness and encourage people from all backgrounds to apply and realize their full potential as part of our team.

How we look after you

We help take care of your today and tomorrow with industry-leading benefits, support, and services that look after your holistic health and wellbeing. We're also champions of life balance and offer flexible arrangements that work for you (role and location dependent). We're always looking for new ways of working that bring out our best, which leads to unexpected ideas. So here, you'll experience a sense of belonging, and discover autonomy, freedom, and ownership as you work alongside talented people you enjoy sharing knowledge with.

We're proud to say we're an equal opportunity employer and welcome all applicants for employment without attention to race, colour, religion, sex, sexual orientation, gender identity, national origin, veteran, age, disability status or any other protected characteristic. Should you need reasonable accommodations during the recruitment process, please let us know so that we can do our best to set you up for success.
Share: mail