JOB DESCRIPTION

Reporting to the Engineering Lead – Service Availability, the position holder will be tasked with monitoring & Observability and improving the operational aspects of all systems in scope within DIT. Drive automation and Dev-ops across the different domains. Foster service monitoring through proactive initiatives like AIOPs, machine learning among other available channels. 

RESPONSIBILITIES

  • Proactively building and implementing monitoring services, including end to end monitoring, scripting and automation, modern tooling and maintenance software. 
  • Use of AI and Machine learning to perform log analysis and create predictive models that will assist in identifying potential failures. 
  • Developing and executing automation scripts and maintenance jobs. 
  • Developing automation around monitoring.
  • Onboarding DIT systems to the service monitoring tools (APMs like ELK).
  • Clearly document any monitoring gaps noted and collaborate with the relevant teams to ensure timely closure. 
  • Performance of Applications error analysis and follow-up to ensure optimal customer experience.
  • Deployment of planned & operational changes on systems in scope. 
  • Support all Digital squads to ensure new products are monitored.
  • Support in Zero touch Operations initiatives.
  • Support in development of collectors and agents

QUALIFICATIONS

  • Bachelor’s Degree in either Computer Science or Information Technology, Electrical and communication engineering or Business Information Systems or in a relevant field in telecommunication.
  • Domain knowledge in at least 2 of the following areas , Sysadmin especially Linux, Orchestration (Kubernetes), Linux Kernel, Open telemetry.
  • Good understanding of back-end programming such us Python & RUST
  • Technical understanding of SRE concepts & DevOps Practices with respect to providing stable services to customers and adhering to availability KPIs, Service Level Objectives, Service Level Indicators & conforming to target monthly error budget. 
  • Be well versed with one or more modern monitoring tools such as ELK, Prometheus, Dynatrace, AppDynamics, New Relic, Splunk etc. 
  • Good understanding of the micro service architecture & appreciation of the traditional/classic SOA
  • Ability to manage a team having leadership skills, ownership of issues been analytical and a problem solver.
  • Being able to implement strict change management policy.
  • Conversant with agile ways of working.

 

How to Apply
If you feel that you are up to the challenge and possess the necessary qualification and experience, kindly proceed to update your candidate profile on the recruitment portal and then Click on the apply button. Remember to attach your resume.
 

 

ABOUT US

We are the leading telecommunication company in East Africa. Our purpose is to transform lives by connecting people to people, people to opportunities and people to information. We keep over 42 million customers connected and play a critical role in the society, supporting over one million jobs both directly and indirectly while our total economic value was estimated at KES 362 Billion ($ 3.2 billion) for the 12 months through March 2021. We are listed on the Nairobi Securities Exchange (NSE) and with annual revenues of close to KES 298 Billion ($2.5 billion) as at March 2022. We were founded in 1997 as a fully owned subsidiary of Telkom Kenya before a 40 percent acquisition by Vodafone Group PLC in May 2000, and a public offering of 25 percent shares through the NSE in 2008. Under the management of Vodafone Group PLC, we welcomed Michael Joseph, as our first CEO, a few months later in July of 2000. He led the company’s growth to accommodate 16.71 million subscribers from the previous 20,000, largely owing to innovative products like M-PESA in 2007.

 

 

 

 

Follow Us on Social Media