TruU is looking for an experienced Site Reliability Engineer to own the development and operations of a large-scale, cloud-centric application. This individual would be responsible for developing and deploying new capabilities in our AWS cloud, as well as own and solidifying the existing infrastructure.

In this role, you will put to use a diverse set of technical expertise; including DevOps tooling, troubleshooting and debug skills, SOC2 compliance, cloud computing, coding/scripting, big-data management, and infrastructure engineering. Beyond your technical qualification, you will also have a healthy combination of cross-group collaboration skills, communication and relationship building skills.

Responsibilities

  • Help create our platform for observability, monitoring and tracing
  • Metrics for capturing performance monitoring
  • Help with application tracing for diagnosis, post mortem analysis
  • Help deploy a secure and reliable system in production that can scale
  • Documenting and conducting post incident reviews

Skills and Experience

  • Must have experience at least 5 years of working on Cloud Platforms. Aws is preferred (we use AWS)
  • At least 5 years of experience scripting and automation experience (Bash, Shell, Powershell, JavaScript, etc)
  • Strong knowledge/experience with microservices and use / orchestration of containers (e.g. Docker, ECS, ECR, Kubernetes, Fargate)
  • Logging and Monitoring Telemetry (e.g. Splunk, CloudWatch, etc.)
  • Experience in software Project life cycle activities designing, supporting and deploying systems comprising one or more of the following: Java (Spring), Kafka, Elastic Search, PostGres, Redis, Python (Django), Angular2+
  • Perform deployments, patches, upgrades, configurations in a controlled, pre-production, and production environment with strict operating parameters
  • CI/CD Pipeline experience (TeamCity preferred, but Jenkins is acceptable)
  • Focused experience on the Ops side vs. Dev side
  • Experience with ULM tools like datadog, sumologic
  • Nice to have experience with performance monitoring tools like New Relic, etc.

To be successful in this role, you must have good communication skills both verbal and written, enjoy continual learning and constant improvement and comfortable being on-call in the event of a system outage. You must be a highly motivated individual who is experienced enough to plan ahead and document your work amidst a fast-paced environment rooted in security.

About TruU

TruU is a cyber security company that is transforming the way users are identified in order to provide digital and physical access as a frictionless

experience. We are solving the problem of giving users trusted access to digital and physical sites without the need for passwords. We are a

startup environment that is focused on empowering the team to architect and implement elegant solutions to the interesting and challenging

problems before us, in an enjoyable environment that fosters our own professional and personal growth.