Senior Platform Engineer
ExaCare
Company Overview
We are a trailblazing health tech company on a mission to revolutionize the nursing home & post-acute space. Our innovative AI software is transforming the admissions process and care delivery in these settings. We’ve raised $10.35M to date and are experiencing rapid growth. We are looking for a Senior Platform Engineer to join our growing team.
About the Role
Organizations switch to ExaCare because their admission processes are broken. They rely on multiple outdated platforms and error-prone manual work to admit residents. This fragmentation wastes time and leads to suboptimal care and business outcomes. We are seeking a high‑ownership platform engineer to design, build, and operate our AWS serverless platform—owning the infrastructure, CI/CD, observability, and guardrails that let teams ship quickly, safely, and with confidence.
What You’ll Do
- Own our entire platform. Build and operate AWS Lambda, Step Functions, SQS/SNS, and event-driven workflows.
- Manage data runtime. Operate Aurora PostgreSQL, schema migrations, performance tuning, and connection management inside VPC.
- Level up CI/CD. Evolve GitLab CI pipelines, caching, test sharding, and deployment stages with safe rollouts and automated DB migrations.
- Strengthen security. Implement least-privilege IAM, secrets via SSM/Secrets Manager, environment isolation, and secure VPC networking.
- Improve observability. Standardize logging/metrics/tracing with CloudWatch and Lumigo; define SLOs, alerts, and on-call runbooks.
- Optimize costs and performance. Tune concurrency, batch sizes, and runtimes; right-size Aurora and Redis usage; track and reduce spend.
- Build enabling tooling. Create golden paths, CLIs, and internal automations that remove toil and speed developer velocity.
- Lead operational excellence. Participate in on-call, drive postmortems, and champion reliability best practices.
- Pitch in wherever the startup needs it: from shipping user‑facing features and creating APIs to writing React UI when it helps the team move faster.
What You’ll Bring
- Bachelor’s degree in computer science or similar technical field
- Self-starter who turns high-level goals into execution plans
- 7+ years in platform/SRE/infrastructure roles building scalable distributed systems
- Hands-on AWS serverless experience (Lambda, API Gateway, Step Functions, SQS/SNS, S3)
- Strong with Aurora/PostgreSQL, RDS Proxy, and SQL performance fundamentals
- Proven CI experience (pipelines, runners, Docker-in-Docker, artifacts/caching)
- Proficiency with the Serverless Framework, CloudFormation, and IaC best practices
- Solid observability skills (CloudWatch, tracing/metrics/logs, SLOs) and incident response
- Familiarity with Redis (ElastiCache), OpenSearch, and VPC networking
- Comfortable automating with TypeScript/Node, Python, and Bash
- Bonus: experience running LLM workloads (Modal.com/Bedrock)
Benefits and Perks
- Competitive salary and equity in a high-growth startup
- Paid time off at your discretion
- Optional to be fully remote or work hybrid if based in Toronto, Vancouver, or NYC
- Medical, dental and vision coverage
- Access to AI tooling: Cursor, Claude Code, Codex
- Great start-up culture