Skip to content

Jalapeno Infrastructure

Jalapeno has the following components that create its infrastructure: Kafka, ArangoDB, InfluxDB, Grafana, and Telegraf.

All infrastructure components reside in the Jalapeno Kubernetes cluster and are deployed using the /install/infra/deploy_infrastructure.sh script.

Each Jalapeno infrastructure component defines its deployment in the respective YAML files. These files allow for the configuration of services, deployments, config-maps, node-ports and other Kubernetes components.

Kafka

Kafka is Jalapeno's message bus and core data handler.

Jalapeno's Kafka instance handles two main types of data: BMP data (topology information) supplied by the GoBMP collector, and telemetry data (network metrics) supplied by Telegraf. Jalapeno Processors are responsible for reading and restructuring the data, and for inferring relevant metrics from the data.

BMP data is organized into Kakfa topics such as gobmp.parsed.peer and gobmp.parsed.ls_node. These topics are further parsed to create representations of the network topology using the Topology Processor.

Telemetry data is collected in the jalapeno.telemetry topic. Data in this topic is pushed into Telegraf (a telemetry consumer), and onwards into InfluxDB.

Kafka is deployed using kubectl, as seen in the deploy_infrastructure.sh script. The configurations for Kafka's deployment are in the YAML files in the jalapeno/infra/kafka/ directory.

ArangoDB

ArangoDB is Jalapeno's graph database.

Jalapeno Processors parse through data in Kafka, then create various collections in ArangoDB. These collections represent both the network's topology and its current state. For example, the Topology Processor parses BMP messages that have been streamed to Kafka and builds out collections such as "ls_node" and "l3vpn_prefix_v4" in Jalapeno's ArangoDB instance. These collections, in conjunction with ArangoDBs rapid graphical traversals and calculations, make it easy to determine the lowest-latency path, etc.

ArangoDB also houses the most interactive aspects of Jalapeno. Processors for Bandwidth and Latency upsert their scores here. Clients can run queries to generate label stacks for their desired network optimization attribute.

ArangoDB is deployed using kubectl, as seen in the deploy_infrastructure.sh script. The configurations for ArangoDB's deployment are in the YAML files in the jalapeno/infra/arangodb/ directory.

To access ArangoDB's UI, log in at <server_ip>:30852, using credentials root/jalapeno. In the list of DBs, select jalapeno.

InfluxDB

InfluxDB is Jalapeno's time-series database.

Telemetry data is parsed and passed from Kafka into InfluxDB using a telemetry consumer (Telegraf).

The data stored in InfluxDB can be queried by Processors and by the ArangoDB Jalapeno API. These queries construct and derive relevant metrics. For example, a processor could generate rolling-averages of bytes sent out of a router's interface, which would be used to simulate link utilization.

Using InfluxDB as a historical data-store, Jalapeno Processors can also infer trends based on historical analysis. Processors and even applications can determine whether instantaneous measurements are extreme anomalies, and can enable requests for any number of threshold-based reactions.

InfluxDB is deployed using kubectl, as seen in the deploy_infrastructure.sh script. The configurations for InfluxDB's deployment are in the YAML files in the jalapeno/infra/influxdb/ directory.

To access InfluxDB via Kubernetes, enter the pod's terminal and run:

influx
auth root jalapeno
use mdt_db
show series

Grafana

Grafana is Jalapeno's visual dashboard and metric visualization tool.

Loaded with InfluxDB as its data-source, Grafana has various graphical representations of the network, including historical bandwidth usage, historical latency metrics, and more.

Grafana is deployed using kubectl, as seen in the deploy_infrastructure.sh script. The configurations for Grafana's deployment are in the various YAML files in the jalapeno/infra/grafana/ directory.

To access Grafana's UI, log in at <server_ip>:30300, using credentials root/jalapeno.

A pair of example dashboard configurations can be found here:

Interface Egress Stats

and here:

Interface Ingress Stats

Telegraf-Egress

Telegraf is a telemetry consumer and forwarder.

Within the Jalapeno infrastructure, the Telegraf-Egress deployment of Telegraf consumes data from Kafka and forwards this data to InfluxDB.

Telegraf-Egress is deployed using kubectl, as seen in the deploy_infrastructure.sh script. The configurations for the Pipeline deployments are in the various YAML files in the jalapeno/infra/telegraf-egress/ directory.

Note

There exists another Telegraf instance (Telegraf-Ingress) that is part of Jalapeno's Collectors. The ingress instance consumes data from devices directly before forwarding the data to Kafka.