About this role
Who we are looking for:We're looking for a Senior Backend Engineer with a strong focus on infrastructure, platform, and reliability to be the first dedicated hire in this function. We need someone with at least 3-5 years of full-stack or backend experience, who has a proven track record of building and maintaining scalable, reliable systems in a fast-paced environment. You should be comfortable taking ownership of our entire infrastructure, from identifying risks to building out a long-term reliability roadmap, while also being able to jump in to build production code. Bonus points if you have experience at an early-stage, high-growth startup (Series A to C) and have a background as a software engineer who transitioned into infrastructure.What you'll do:Take ownership of Vizcom's infrastructure, establishing baseline reliability metrics and identifying platform risks within the first 30 days.Design, build, and operate our distributed systems, including job queues, streaming, caching, and observability using tools like Datadog and Sentry.Lead the improvement of our platform's reliability by tightening incident response mechanics, building runbooks, and strengthening CI/CD and deployment flows.Collaborate with our AI and full-stack engineers to integrate GPU inference pipelines and ensure the backend infrastructure supports user workflows seamlessly.Develop and publish a comprehensive reliability roadmap with clear ownership and milestones after your first 60 days.Write production code and leverage your general software engineering knowledge (TypeScript, Node.js) to understand and improve the entire stack.