Staff Engineer — Platform & Enterprise Readiness

San Francisco · On-site$160k – $250k + Equity

About this role

Cosmon is looking for a Staff Engineer to own the Nexus desktop platform and enterprise deployment experience that every integration and customer deployment relies on. This role sits on the platform team solving native Windows, COM, update, and deployment problems that directly determine onboarding speed, reliability, and whether POCs convert to customers. Your work will shape product reliability across Fortune 100 environments.

What you'll do

  • Own the Nexus desktop app framework, set architecture, and drive runtime decisions.
  • Build COM and native interop infrastructure including client helpers, lifecycle management, threading, and marshaling.
  • Design process reliability features such as sandboxing, watchdog supervision, crash isolation, and observability.
  • Define and implement plugin and extension architecture for versioning, loading, isolation, and independent updates.
  • Deliver enterprise deployment packages that handle Defender, firewall rules, signing, MSI packaging, Group Policy, SCCM, and Intune edge cases.
  • Own update and rollback systems, including silent updates, staged rollouts, and air-gapped compatibility.
  • Partner directly with customer IT/security teams during POCs to resolve installer, Defender, and registration issues.
  • Drive onboarding time-to-value from download to first successful workflow and remove friction in first-run experiences.
  • Set and hit MTTR targets for client-reported issues and build triage and debugging infrastructure.
  • Track and reduce production bug count through severity-weighted metrics and ownership of reliability trends.
  • Build platform observability (logs, traces, telemetry) that works in locked-down enterprise environments.

What Cosmon is looking for

  • 8+ years building production software, with at least 3 years in a staff or senior IC role owning system-level outcomes.
  • Deep experience shipping desktop applications into enterprise Windows environments, including MSI, signing, Defender, Group Policy, SCCM/Intune, and firewall issues.
  • Strong expertise in COM (threading models, marshaling, out-of-process hosting, debugging HRESULTs).
  • Strong C/C++ skills with working knowledge of Python or TypeScript for tooling and higher-level components.
  • Experience in process-level reliability engineering (watchdogs, crash recovery, IPC, sandboxing).
  • Demonstrated production debugging skills with tools like WinDbg, Event Viewer, and MSI verbose logs.
  • Experience with Windows security tooling (Defender, SmartScreen, EDR, certificate management) and enterprise update systems (Squirrel, MSIX, custom updaters).
  • Experience operating in air-gapped or heavily network-restricted environments and building plugin/extension architectures.
  • Exposure to engineering or scientific desktop software (CAD/CAE/PLM) is a plus.

Company at a glance

Cosmon develops computer-aided engineering by building AI that thinks like an engineer, reimagining CAE for the AI era. It operates offices in San Francisco and Palo Alto and is hiring AI/ML engineers.

IndustryAI/ML
Team Size11-50
WorkspaceOn-site
Founded2025
Locations
531 Howard St G/F, San Francisco, CA 94105, USA ·San Francisco, CA, USA
Websitecosmon.com
LinkedInLinkedIn

Know someone who'd be great for this?