DevOps

Supercomputer Dashboard

Django dashboard for monitoring SahasraT supercomputer at IISc - India's fastest supercomputer with

DjangoPythonLinuxHPCPostgreSQLRedisSLURM

Project links not publicly available

Supercomputer Dashboard

About the Project

Supercomputer Dashboard

A comprehensive monitoring and management dashboard for SahasraT, the flagship supercomputer at the Supercomputer Education and Research Centre (SERC), Indian Institute of Science (IISc).

About SahasraT

SahasraT was India's fastest supercomputer during my tenure, featuring:

  • 33,000+ CPU cores across multiple node types
  • Petascale storage with Lustre parallel filesystem
  • InfiniBand interconnect for high-speed node communication
  • Serving 500+ researchers across India

Dashboard Features

Real-time Monitoring

  • Node health status with color-coded visualization
  • CPU, memory, and GPU utilization metrics
  • Storage capacity and I/O performance
  • Network traffic and InfiniBand statistics

Job Management

  • SLURM job queue visualization
  • Historical job analytics and reporting
  • Resource allocation trends
  • User quota management

Alerting System

  • Configurable threshold-based alerts
  • Email and SMS notifications
  • Integration with Nagios for infrastructure alerts
  • Incident tracking and resolution workflow

Technical Implementation

  • Backend: Django 3.x with Django REST Framework
  • Frontend: Bootstrap 4 with Chart.js for visualizations
  • Data Collection: Custom Python agents using SLURM and IPMI APIs
  • Database: PostgreSQL with TimescaleDB for time-series metrics
  • Caching: Redis for real-time metric caching
Project Details

Year

2021

Category

DevOps

Want to see more of my work?

Back to All Projects