Jump to main content

Transport options

Public Transport
Back to results

Site Reliability Engineer

The Core, Bristol, BS1 6JX
Full time
The role

We’re actively looking for people to join our teams and we’re committed in protecting your health and wellbeing during every step of our recruitment process.

If you’re successful in securing a role with us you’ll find we’ll be doing things a little bit differently and it may mean you’ll initially join our team by working from home.

During this time we’ll work with you to make sure you have the tools and equipment you need and that you feel part of our amazing DLG team!

What we're looking for:

We are building our Site Reliability Engineering capability and are looking for talented and hardworking individuals to join our team. As a Site Reliability Engineer you will typically spend up to 50% of your time on operations work: ensuring the daily health and performance of our software services. The remainder of your time will be spent on development activities, from building new business functionality to automating routine tasks and implementing monitoring systems.

You should have a keen interest in either an Application or Infrastructure support role with experience in one or more of the following:

  • Programming languages: JavaScript/TypeScript, Python, Java, Shell scripting
  • Configuration management: AWS CDK, CloudFormation, Ansible
  • Database: SQL skills
  • Knowledge of cloud-based technologies including AWS and Docker.
  • Monitoring tools, including: AppDynamics, CloudWatch, PagerDuty, ELK Stack

Are you looking to broaden your skillset and capabilities in a rapidly growing environment?

Analytical and problem-solving skills, with the ability to come up with practical solutions for production systems in a time critical environment, are also key!

As this is a new capability we will work to help talented individuals gain key the skills of a Site Reliability Engineer. You will build a deep understanding of the application, the code, how it runs, how it is configured and how it scales. This knowledge will make you invaluable at monitoring and supporting the applications as a Site Reliability Engineer!

Who you'll be working with:

Application Development and Maintenance perform a number of functions within Technology Services, we will help develop your technical capabilities to enable your expertise in supporting our programs, from implementing new business functionality to improving the many DLG business and brands.

What you'll be doing:

  • Performing Application Development and Maintenance activities to ensure high availability and reliability of our systems and data
  • Actively monitoring and reviewing application performance
  • Handling on-call and emergency support, ensuring software has robust logging and diagnostics
  • Working with the Platform Engineering capability to enable rapid continuous integration and deployment of application change
  • Providing oversight and governance of all changes across the environment.
  • Providing subject matter expertise input into technology, strategy, commercial and solution design activities
  • Building and maintaining operational runbooks
  • Working on feature requests, defects and other development tasks
  • Tracking industry and market developments in relation to DLG applications and emerging needs
  • Contributing to the overall product roadmap

What we'll give you:

We're always encouraging internal development and you'll have access to loads of learning opportunities, events and conferences to build your industry knowledge. DLG reached number 35 on Glassdoor's top places to work in 2018, because of our working culture, dedication and passion for developing our colleagues; a place where you can truly build a long-lasting career.