Job Details

Senior Analyst, Site Reliability Engineer


Date Opened: 07/04/2022

Job Type:

Job Number: 220003FM

Job Description

Who we are:


As North America’s oldest startup and Canada’s purpose-driven digital marketplace, The Bay is on a high-growth mission to rewrite the rules of retail to help Canadians live a colorful life. If you believe in the power of our iconic brand and thrive on problem-solving at scale, we want you to join our team.

At The Bay, smart, high-performing team members will challenge you to learn and grow every day. We value ambitious work and great ideas grounded in data and insights. We're looking for talented people who love a fast-paced environment, embrace change, and are looking to make an impact with groundbreaking ideas.

We are building a digital-first company and brand for a diverse world and we need an inclusive team to reach our potential. We strongly encourage applications from everyone to come and join a winning team that supports diverse thinking and demonstrates innovation, energy, creativity, and vision every day.

You can learn more and view available positions in Bengaluru, by visiting

What This Position is All About
The Site Operations Engineer is primarily responsible for ensuring the optimal operational health, stability, and
performance of the The Bay’s e-commerce platform, CRM platform, Mobile applications. and related technologies.
They are responsible for monitoring system health, addressing incidents, and assist with problem investigations. As
site administrators, the Site Operations Engineer will support Salesforce Cloud applications in terms of configuration
management, capacity planning, tuning, and optimization. As members of the Site Reliability Engineering team, the
Site Operations Engineer plays a critical role in providing ongoing feedback to improve site performance, availability,
and resiliency, and is therefore, expected to work closely with Development teams, Product Management, Business
users, and other Site Reliability Engineers. This position reports directly to the Manager, Site Reliability Engineering.

Bachelor’s Degree in Computer Science or equivalent
? 5 plus years of SRE experience working on telemetry, observation, self-healing solutions, and platform
? AWS, Azure, Microsoft, Salesforce CC and/or Salesforce SC, certifications, and knowledge of ITIL practices
? Strong troubleshooting, analytical, and problem-solving skills
? Experience in the administration and support of Digital Retail Platforms, e.g. Salesforce CC, Shopify,
Magento, IBM WebSphere Commerce, etc.
? Experience with monitoring, logging & telemetry tools eg: New Relic, Splunk, ELK, Nagios, SolarWinds,
Prometheus, AWS Cloudwatch, Datadog, etc.
? Basic understanding of Networking, Content Delivery Networks (CDN, e.g. Akamai, Cloudflare), and Cloud
? Hand-on experience in the monitoring of streaming platform technologies, eg Apache Kafka.
? Experience with automation and tools such as (but not limited to) Jenkins, Chef, Terraform, Ansible, etc.
? Experience in creating and maintaining Automation (PowerShell, Python, Ruby, AWK, SED, etc.) to run
health-checks and self-healing capabilities for the platforms.
? Strong verbal and written communication skills.
? Networking fundamentals: TCP/IP, DNS, WINS, DHCP, etc.
? Collaboration & Change Management tools: Jira, ServiceNow, Cherwell, etc.
As the Site Operation Engineer, you will:

? Monitor systems and telemetry of the The Bay’s Digital Platform, including Salesforce Commerce Cloud, Service
Cloud, and Mobile platforms for operational health in terms of site availability, reliability, capacity, and performance.
? Prioritize and develop automated administrative and operational tasks to continuously improve site stability,
capacity, and reliability.
? Provide active incident response support, investigate major problems, and ensure the timely and effective return to
normal operations of The Bay’s Digital, CRM, and Mobile platforms during major incidents.
? Provide periodic on-call support based on established 24/7/365 support schedules.
? Collaborate with DevOps, Digital Development, and QA teams to ensure that Production environments are
deployment ready in accordance with The Bay’s Change Management processes and the Digital release schedule.
• Provide L2 and L3 operational support to business partners, developers, and cross-functional technology teams as needed 

Job Qualifications

Thank you for your interest with HBC. We look forward to reviewing your application.


HBC provides equal employment opportunities (EEO) to all employees and applicants for employment.