At BPTN – Black Professionals in Tech Network we’re pushing the future of tech forward by creating a space for Black professionals in tech to gather, grow and evolve – all while being a conduit for companies to engage this talent across North America.
We’re here to help Black professionals network, connect with one another, share resources and grow their careers. Our rapidly growing network counts over 50,000 Black professionals. We provide our members with access to mentorship, skill-building opportunities, and a strong peer network to support professional growth and advancement.
Our client is looking for a Site Reliability Expert II to join their global Hospitality SRE team. You’ll be focussing on the strategic expansion and scalability improvements of our flagship product – while being part of the global team to keep the lights on on all platforms.
Responsibilities
- Initiate and contribute to continuous improvement of our software delivery processes and practices in a multi-location, multidisciplinary team to empower and accelerate product development
- Use automation extensively to design, configure, manage, and monitor systems in support of our product development teams
- Design and architect operational solutions with the specific goal of increasing the standardization, automation, repeatability, cost-efficiency and consistency of operational tasks
- Work with developers and other SRE to design and build scalable and reliable Cloud cost efficient infrastructure
- Write and maintain architectural, stakeholder, policy and processes documentation
- Adhere to and advocate for best practices, including Infrastructure as Code, monitoring, high availability, disaster recovery, security, and DevOps methodologies
- Collaborate with development teams and use intuition, experience and understanding to create SLIs, SLOs, and SLAs
- Provide timely assistance and remediation solutions during critical situations and production incidents to help resolve service problems (You will be on call for periods of time)
Qualifications
- Strong customer-focused mindset
- Quality and reusability oriented
- Good knowledge of Amazon Web Services and/or Google Cloud Platform
- Good understanding of Agile development and continuous delivery best practices, software engineering tools, processes, methods and testing
- Ability to partner effectively with other teams
- Ability to plan, organize, prioritize and stay focused
- Strong experience with Docker, Kubernetes, Linux Systems, Config Management
- Strong experience with datastores: MySQL, ElasticSearch, Kafka
- Strong “Automate All The Things” mindset
- Experience with Infrastructure as code practices, we use Terraform
- Understanding of Secret management with Vault or similar systems
- Good experience provisioning and managing infrastructures with high availability constraints
Locations
- Amsterdam, Netherlands
- Berlin, Germany
- Ghent, Belgium
- Hamburg, Germany
- London, United Kingdom
- Montreal, QC, Canada
- Ottawa, ON, Canada
- Toronto, Ontario, Canada
- Paris, France
- Providence, Rhode Island, United States