Discogs Inc: Senior Site Reliability Engineer - Data (REMOTE)
The Discogs Platform team is focused on several objectives: building and supporting performant, cost-effective, reliable infrastructure; developer experience tooling and mentorship; and creating "golden paths" for organization-wide standards and velocity. As a key member of the Platform team, the Senior Site Reliability Engineer - Data will be working closely with other Discogs engineering squads to develop and optimize scalable, well-planned relational database architectures, drive best practices and stability for our use of Kafka and change data capture, and contribute to the Platform team’s operations.LocationThis is a remote position. Open to candidates located in OR, WA, CA, CO, TX, ILCompensationStarting Base Salary Range: - yearlyWhat You’ll AccomplishReasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.Stewarding Discogs’ data stores as a key subject matter expertLeading efforts on the reliability and design patterns of our Kafka and Kafka Connect implementationsEstablishing data contracts and clear communication standards between CDC producers and consumersWorking closely with engineering squads to refactor and re-architect MySQL database schema and indexing for long-term scalability, performance, and cost effectivenessMentoring engineering squads on Platform best practices for MySQL, Kafka, and other software development lifecycle areas Writing documentation and runbooks that contribute to the engineering organization’s knowledge baseWorking in a containerized, orchestrated environmentContributing to the Platform team’s disciplines of site reliability and operations, supporting both our squads and Platform’s central infrastructureParticipating in on-call rotation, responding to incidents, and troubleshooting data and other operations issuesWhat You’ll ContributeMinimum Education and ExperienceA Bachelor's Degree in Computer Science or similar area of focus, or equivalent relevant work experience.5+ years of experience working with Kafka and relational database management systems.6+ years experience in Ops, DevOps, Site Reliability, Platform or other systems roles.Required Skills & Abilities:Relational database schema design, query performance optimization, administrationKafka: Cluster administration, Kafka ConnectCI/CDGitOpsKubernetesAWS and cloud developmentObservabilityScriptingTrack record of collaboration and mentorshipExcellent written communication and documentation skillsContinuous learningOwnership and proactive approach to solving large problemsPreferred:Infrastructure-as-codeElasticsearchPythonGraphQLREST APIHashicorp VaultRedisMemcachedNoSQL DatabaseData Lake/WarehouseData GovernanceData SecurityThe Platform team covers a wide range of technical topics and we'd love to hear about your skills beyond this list!Apply NowLet's start your dream job Apply now Meet JobCopilot: Your Personal AI Job HunterAutomatically Apply to Remote DevOps and Sysadmin JobsJust set your preferences and Job Copilot will do the rest-finding, filtering, and applying while you focus on what matters. Activate JobCopilot
#discogs #inc #senior #site #reliability
Discogs Inc: Senior Site Reliability Engineer - Data (REMOTE)
The Discogs Platform team is focused on several objectives: building and supporting performant, cost-effective, reliable infrastructure; developer experience tooling and mentorship; and creating "golden paths" for organization-wide standards and velocity. As a key member of the Platform team, the Senior Site Reliability Engineer - Data will be working closely with other Discogs engineering squads to develop and optimize scalable, well-planned relational database architectures, drive best practices and stability for our use of Kafka and change data capture, and contribute to the Platform team’s operations.LocationThis is a remote position. Open to candidates located in OR, WA, CA, CO, TX, ILCompensationStarting Base Salary Range: - yearlyWhat You’ll AccomplishReasonable accommodations may be made to enable individuals with disabilities to perform the essential functions.Stewarding Discogs’ data stores as a key subject matter expertLeading efforts on the reliability and design patterns of our Kafka and Kafka Connect implementationsEstablishing data contracts and clear communication standards between CDC producers and consumersWorking closely with engineering squads to refactor and re-architect MySQL database schema and indexing for long-term scalability, performance, and cost effectivenessMentoring engineering squads on Platform best practices for MySQL, Kafka, and other software development lifecycle areas Writing documentation and runbooks that contribute to the engineering organization’s knowledge baseWorking in a containerized, orchestrated environmentContributing to the Platform team’s disciplines of site reliability and operations, supporting both our squads and Platform’s central infrastructureParticipating in on-call rotation, responding to incidents, and troubleshooting data and other operations issuesWhat You’ll ContributeMinimum Education and ExperienceA Bachelor's Degree in Computer Science or similar area of focus, or equivalent relevant work experience.5+ years of experience working with Kafka and relational database management systems.6+ years experience in Ops, DevOps, Site Reliability, Platform or other systems roles.Required Skills & Abilities:Relational database schema design, query performance optimization, administrationKafka: Cluster administration, Kafka ConnectCI/CDGitOpsKubernetesAWS and cloud developmentObservabilityScriptingTrack record of collaboration and mentorshipExcellent written communication and documentation skillsContinuous learningOwnership and proactive approach to solving large problemsPreferred:Infrastructure-as-codeElasticsearchPythonGraphQLREST APIHashicorp VaultRedisMemcachedNoSQL DatabaseData Lake/WarehouseData GovernanceData SecurityThe Platform team covers a wide range of technical topics and we'd love to hear about your skills beyond this list!Apply NowLet's start your dream job Apply now Meet JobCopilot: Your Personal AI Job HunterAutomatically Apply to Remote DevOps and Sysadmin JobsJust set your preferences and Job Copilot will do the rest-finding, filtering, and applying while you focus on what matters. Activate JobCopilot
#discogs #inc #senior #site #reliability
·113 Views