Skip to content

IIMS – Implemented Scope & Planned Roadmap Overview

Information Infrastructure Management System (IIMS)


Purpose

This document provides a concise executive and technical overview of:

  • What IIMS Implemented delivers today
  • What is intentionally planned for Planned and future evolution

It serves as:

  • A scope reference for engineering teams
  • A roadmap and planning guide for management
  • A positioning document for stakeholders and architects

This document complements the detailed architecture and workflow documents and acts as the single “what is done vs what is planned” reference.


1. Implemented Strategic Positioning

1.1 What Implemented Is

Implemented establishes IIMS as a unified operational coordination and visibility platform.

The primary goals of Implemented are:

  • Unify inventory, monitoring, incidents, maintenance, and tickets into one model
  • Provide a single operational UI for NOC and operations teams
  • Eliminate fragmented visibility across monitoring and ticketing systems
  • Build a strong provider‑independent foundation for future automation

Implemented focuses on:

  • Stability
  • Correctness
  • Provider abstraction
  • Manual and operator‑driven workflows

Implemented intentionally avoids heavy automation and intelligence in order to:

  • Reduce risk
  • Validate the domain model
  • Establish trust with operators

1.2 What Implemented Is Not

Implemented is not yet an autonomous or self‑healing operations platform.

In Implemented:

  • Correlation is rule‑light and mostly operator‑driven
  • Topology is used for visualization, not automatic impact propagation
  • Root cause analysis is manual
  • SLA enforcement and escalation are limited
  • Routing and failover are policy‑driven, not adaptive

These advanced capabilities are intentionally deferred to Planned.


2. Implemented Delivered Capabilities (Current Scope)

2.1 Unified Infrastructure Model

Implemented delivers a complete infrastructure foundation:

  • Hierarchical Site model (regions, data centers, zones)
  • Asset inventory with types, tags, enable/disable, provisioning state
  • Geo‑location with inheritance (asset inherits site location)
  • Roll‑up alert and severity summaries by site

This provides a consistent structural model for all operational workflows.


2.2 Monitoring Integration (Zabbix)

Implemented delivers stable monitoring integration:

  • Monitoring Targets abstraction
  • Zabbix adapter implementation
  • Asset provisioning and template binding
  • Alert ingestion and normalization
  • Interface discovery and telemetry queries

Routing features:

  • Site‑aware target resolution
  • Collector routing using constraints (site, tags, asset type, capacity)
  • Deterministic and policy‑driven routing

2.3 Alert Management

Implemented delivers:

  • Alert ingestion and normalization
  • Alert history and alert cache
  • Acknowledgement and close actions
  • Maintenance‑aware suppression

Limitations (by design):

  • Correlation is rule‑light
  • No automatic incident creation engine

2.4 Incident Management

Implemented delivers a stable human workflow model:

  • Manual and rule‑light incident creation

  • Incident lifecycle:

  • New

  • Investigating / Open
  • Resolved
  • Closed

  • Alert ↔ Incident linking

  • Activity timeline for all incident actions

Limitations:

  • Ownership and assignment are basic
  • No SLA timers or escalation policies

2.5 Ticketing Integration (Zammad)

Implemented delivers:

  • Ticket creation from incidents
  • Ticket reference storage in incidents
  • Limited periodic status synchronization

Safety controls:

  • Ticket creation blocked during maintenance

Limitations:

  • No real‑time bi‑directional synchronization
  • Single ticket per incident (practical limit)

2.6 Maintenance and Operational Control

Implemented delivers first‑class maintenance handling:

  • Maintenance windows by site, asset, tags, or scope
  • Alert suppression during maintenance
  • Incident and ticket creation blocked during maintenance
  • Full audit trail of maintenance actions

This provides reliable noise suppression and operational safety.


2.7 Topology and Geo Visualization

Implemented delivers:

  • Topology Links with LineString geometry
  • Manual status binding and policy‑based link status
  • GeoMap with clusters, assets, and links
  • Bounding‑box queries and cached map summaries

Behavior:

  • Visualization and situational awareness
  • Manual impact assessment by operators

Limitations:

  • No automatic impact propagation
  • No blast‑radius or root cause engine

2.8 Caching, Dashboards, and Performance

Implemented delivers:

  • Alert cache and map cache
  • Summary counters for dashboards
  • Fast read‑only UI backed by cached views
  • Background refresh workers

This enables scalable UI performance without overloading providers.


3. Planned Strategic Vision

Planned builds on the Implemented foundation and transforms IIMS from a coordination platform into an intelligent operations automation and decision system.

The strategic goals of Planned are:

  • Reduce manual operator work
  • Automatically correlate signals into problems
  • Infer root causes and blast radius
  • Enforce SLAs and escalation policies
  • Provide real‑time, proactive operations intelligence

4. Planned Roadmap by Capability Area

4.1 Automation & Correlation

Planned Planned capabilities:

  • Automatic alert correlation rules
  • Automatic incident creation and merging
  • Deduplication and noise scoring
  • Time‑based and topology‑based correlation

Business value:

  • Faster incident detection
  • Reduced alert noise
  • Lower operator workload

4.2 Topology Impact & Root Cause Analysis

Planned Planned capabilities:

  • Automatic dependency traversal
  • Blast‑radius computation
  • Root cause candidate identification
  • Service‑level impact modeling

Business value:

  • Faster troubleshooting
  • Clear understanding of business impact
  • Reduced mean‑time‑to‑repair (MTTR)

4.3 SLA, Escalation, and Ownership

Planned Planned capabilities:

  • Incident ownership and assignment model
  • SLA timers and breach detection
  • Escalation policies and on‑call routing
  • Priority‑driven workflows

Business value:

  • Enforced operational discipline
  • Predictable response times
  • Management‑level reporting and accountability

4.4 Multi‑Provider & Intelligent Routing

Planned Planned capabilities:

  • Active multi‑provider monitoring (Zabbix + Prometheus)
  • Provider health scoring
  • Automatic target and collector failover
  • Capacity‑aware and latency‑aware routing

Business value:

  • Higher availability
  • Easier platform migration
  • Reduced vendor lock‑in risk

4.5 Ticketing & Workflow Automation

Planned Planned capabilities:

  • Real‑time bi‑directional ticket synchronization
  • Multi‑ticket per incident
  • Automatic ticket state → incident state mapping
  • Workflow templates and playbooks

Business value:

  • Reduced manual ticket handling
  • Better alignment between NOC and ITSM
  • Faster resolution workflows

4.6 Real‑Time Dashboards & Collaboration

Planned Planned capabilities:

  • Streaming dashboards and push updates (WebSockets)
  • Live topology and map updates
  • Collaborative incident workspaces
  • Post‑mortem and reporting tools

Business value:

  • Real‑time situational awareness
  • Improved collaboration during major incidents
  • Strong operational reporting and analytics

5. Architectural Continuity

A key design goal of IIMS is that Planned builds directly on Implemented without breaking existing workflows.

Implemented already provides:

  • Provider‑agnostic adapters
  • Routing abstraction (targets + collectors)
  • Domain‑driven core model
  • Cached read models
  • Policy engines for links and routing

These foundations allow Planned automation to be added incrementally:

  • Without changing the UI contract
  • Without rewriting domain models
  • Without disrupting operations

6. Summary

Implemented establishes IIMS as a unified operational coordination platform with:

  • Strong inventory and topology foundation
  • Stable Zabbix and Zammad integrations
  • Reliable alert suppression and incident handling
  • Fast, cached dashboards and geo visualization

Planned evolves IIMS into an intelligent operations automation platform by adding:

  • Automated correlation and RCA
  • Topology‑driven impact analysis
  • SLA enforcement and escalation
  • Multi‑provider resilience
  • Real‑time collaboration and analytics

Together, these phases position IIMS as a long‑term strategic operations platform capable of growing from unified visibility into advanced autonomous operations.