Lead Engineer (Platform Services)
About the job
We are looking to hire a Lead Engineer to join our global team in Technology.
This is a challenging, fast-paced and exciting environment, with plenty of opportunities to influence and grow the technology area.
We are looking for a Lead Engineer to build and lead the Platform Services team to proactively ensure the stability, resilience and scale of our monitoring and alerting applications by automation, testing and engineering.
The Lead Engineer will work alongside all areas of technology to ensure that technical solutions are aligned across teams, deliver value to the customers as well as ensuring consistent monitoring, logging and alerting
The Lead Engineer is responsible for building capability and maturing operational ways of working across multiple cross-functional teams, with focus on technical excellence and a high-performance culture.
The key values that we strive towards in the Technology are:
- Continuous Improvement Mindset
Key Responsibilities :
- Line manage engineers in the team and be responsible for their career development, well-being and performance.
- Provide leadership and guidance across the Platform Services team; motivating and driving the team with technical leadership acting as a subject matter expert and leading best practice techniques.
- To lead the Platform Services team in ensuring technical assurance in significant projects, for the delivery of quality technical deliverables, which may involve several teams or technologies.
- Provide coaching and mentoring to the Platform Services team to improve their skillset, increase knowledge and set the benchmark of quality and precision engineering.
- Oversee the implementation of service transition and change and release process changes, ensuring that processes are reviewed and improved with onus on optimisation.
- Evaluate risks and defects, analysing specifications, and customising applications for specific customer needs.
- Work with technology teams to produce and maintain standards, guidelines, and pattern catalogue.
- Measuring and driving delivery and quality improvements through the capture and analysis of metrics.
- Build innovative prototypes and lead technology teams to develop quality solutions, by ensuring monitoring and alerting best practices.
- To lead and influence teams to ensure quality and operational excellence, and to ensure teams are aligned to design patterns and design collateral.
- Work with Engineering management to drive through best practice, techniques and technology both on the team and around the company.
- Foster a culture of open exchange of ideas, innovation and continuous improvements.
- Balance the commercial needs of the business against the ideal technical design, proposing sound phased or tactical implementations where appropriate.
- Understand the importance of and be a strong advocate for non-functionals eg. monitoring, alerting, logging etc.
- Define adequate strategies to deal with technical debt.
- Collaborate with the operational teams to enhance company's Incident and Problem management capabilities.
- Accountable for the quality of the implementation and deployment of the team's work.
- Accountable for the security, capacity and performance of the system.
- Act as escalation point in Incident management processes.
- Take ownership of RCA activities in your respective area.
- Learn from your experiences, adopting a continuous improvement mindset to help make better decisions.
- Proficient in agile practices.
- Demonstrable experience of leadership in complex, cross-company projects to deliver operational improvements.
- Strong engineering background acquired during a previous hands-on development role.
- Managerial and leadership skills, able to motivate and lead personal development plans for employees.
- Driving change and handling difficult situations.
- Dealing with change on a daily basis.
- Ability to demonstrate the value of changes introduced.
- Proven communication skills.
List of skills we think you need:
- Monitoring and Observability - ELK, Grafana, Prometheus, Prometheus, PRTG
- CI/CD - gitlab, jenkins, ansible, terraform
- Scripting - bash, bat, python, perl
- Testing - testrails, k6, gatling
- Traffic Management - F5, haproxy, keepalived
- Web servers - apache, nginx, IIS, Drupal
List of skills we'd love you to have:
- OS Administration - Windows, Linux
- Virtualisation and Orchestration - vmware, docker, kubernetes
- Databases - Oracle, mysql, mssql
- CDNs - CDNetworks, Akamai, Incapsula
- Networking - Cisco, Brocade
- Programming Languages - PHP, .NET, Java, JS, Ruby, Flutter
- Caching - redis, memcached, varnish
- Cloud - Google Cloud, Oracle Cloud
- APIs - REST, JSON, SOAP
- Security - trellix, fireeye, OKTA, cisco anyconnect, entrust