Job Information
Worldpay, LLC Systems Administrator Specialist in Cincinnati, Ohio
Job Description PLEASE NOTE: Sponsorship is NOT AVAILABLE for this opening. US Citizens and Green Card holders ONLY. Thank you. Are you ready to unleash your full potential? We're looking for people who are passionate about payments to chart Worldpay's path to being the largest and most-loved payments company in the world. About the team Worldpay powers 2.2 trillion payments annually across 146 countries in over 135 separate currencies with over a million-merchants supported globally. Worldpay is the largest acquirer by volume globally, we provide a reliable, secure, and scalable payments platform 24x7 365 day a year. Being part of the 200 strong Infrastructure Services organization, you'll help to engineer and deliver the core infrastructure services that power our payments platform. We're responsible for running some very critical systems, maintaining 20,000 servers via an automation platform, thousands of databases and petabytes of storage hosted from our data centres and public cloud. We are looking for talented individuals to join the Infrastructure Services organisation; you'll be a self-starter, possess an analytical mindset and be a change agent. What you will be doing Joining a team of system administrators and engineers responsible for designing, implementing and maintaining System and Cloud Observability & Log Management solutions which ensure that our infrastructure and applications are fully observable, enabling proactive monitoring, real-time analytics, and timely incident response. The team will play a critical role in developing strategies and implementing best practices in observability and log management for on-premises and cloud environments. Your responsibilities may include: Implement and manage observability tools such as Splunk, Zabbix, and similar platforms for infrastructure, applications, and cloud services. Set up and configure dashboards, alerts, and reports that provide visibility into system health, performance, and availability. Develop and maintain centralized logging solutions to ensure comprehensive logging coverage, log retention, and log security. Work with IT, DevOps, and product teams to define key performance indicators (KPIs) and service-level objectives (SLOs) for critical systems and applications. Provide support in monitoring and troubleshooting production systems, using observability tools to identify performance bottlenecks, anomalies, and incidents. Assist in automating monitoring tasks and creating self-healing scripts to enhance system reliability. Analyze logs and telemetry data to provide insights for incident detection, root cause analysis, and performance optimization. Participate in on-call rotations, responding to incidents and using observability tools for rapid diagnosis and resolution. Collaborate with security teams to ensure log management solutions support security monitoring and incident investigation. Continuously evaluate and recommend improvements to observability and log management practices, tools, and processes. What you bring: Experience: Several years of experience in IT Operations, with a focus on observability, and log management. Solid understanding of observability concepts, including metrics, log aggregation, log management, OpenTelemetry (OTEL) concepts and best practices, traces, event management and alerting. Hands-on experience with observability and monitoring tools (e.g., Splunk Enterprise, Splunk Cloud, Splunk Observability, OTEL agents, OTEL collectors, and OTEL gateways, Prometheus, Grafana, Zabbix). Strong understanding of log management best practices, including centralized logging, data retention, and privacy requirements. Familiarity with cloud platforms (e.g., AWS, Azure, GCP) and managing cloud-based monitoring solutions. Experience in designing and implem