SaFi Bank Space : Observability Tech Sync-up



Topics

General Strategy

Please refer to Observability

RoadMap

Collection

  • Logs

    1. common logger enhancement

    2. SAF-166 - Getting issue details... STATUS

  • Traces

    1. Temporal Tracing Support

    2. SAF-147 - Getting issue details... STATUS

  • Metrics

    1. OpenTelemetry vs Micrometer in Micro-services

Monitoring

  1. Micro-service Application

    1. Dashboard → https://grafana.monitoring.brave.safibank.online/d/backend_system_dashboard/backend-system-dashboard?orgId=1

    2. Alert → Not start yet

  2. ThoughtMachine → Observability Stack (In-progress)

  3. Database

    1. need research on this

  4. Infra

    1. Had default dashboard & alerts

  5. Temporal

    1. Create ticket for setup dashboards & alerts

  6. Confluent Kafka

    1. Create ticket for doing the research

  7. Cloud & Network Resources

Actions

  1. Yuetong Yang (Unlicensed) will create Jira ticket for setting up alert rules for micro-services based on current dashboard

  2. BharathKumar D will help to create Jira ticket for setting up monitoring & alerts for database once AlloyDB migration is done

  3. Lucky La Torre (Unlicensed) will help to create Jira ticket for setting up dasboard & alerts for Temporal based on https://safibank.atlassian.net/wiki/spaces/ITArch/pages/124878863/Temporal#Monitoring

  4. BharathKumar D will help to create Jira ticket for researching on Confluent Kafka for how we could monitor Confluent Kafka usage and integrate with our observability stack.