Skip to content

A real-time dashboard for analyzing Apache access logs using Kafka and Spark Streaming.

Notifications You must be signed in to change notification settings

mhagglun/Realtime-Dashboard

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Real-time Dashboard

Architecture

The real-time dashboard is implemented using:

  • Apache Kafka
  • Apache Spark Streaming
  • Producer and Consumer written in Scala and Python
  • Backend: Node.js, Cassandra, socket.io
  • Frontend: HTML, Highchart.js

Usage

Start Zookeeper

zookeeper-server-start.sh $KAFKA_HOME/config/zookeeper.properties

Start kafka

kafka-server-start.sh $KAFKA_HOME/config/server.properties

Start cassandra

cassandra -f

Create topic

kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 1 --partitions 1 --topic access-log

Start the producer and pass the topic name as an argument

python producer.py --topic access-log

Start the consumer to read and process the messages Navigate to src/consumer and run

sbt run

Run the backend server via

node index.js

And view the dashboard here

Screenshot

About

A real-time dashboard for analyzing Apache access logs using Kafka and Spark Streaming.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published