28 Forks
40 Stars
40 Watchers

RealtimeStreamingEngineering

This project serves as a comprehensive guide to building an end-to-end data engineering pipeline using TCP/IP Socket, Apache Spark, OpenAI LLM, Kafka and Elasticsearch. It covers each stage from data acquisition, processing, sentiment analysis with ChatGPT, production to kafka topic and connection to elasticsearch.

How to download and setup RealtimeStreamingEngineering

Open terminal and run command
git clone https://github.com/airscholar/RealtimeStreamingEngineering.git
git clone is used to create a copy or clone of RealtimeStreamingEngineering repositories. You pass git clone a repository URL.
it supports a few different network protocols and corresponding URL formats.

Also you may download zip file with RealtimeStreamingEngineering https://github.com/airscholar/RealtimeStreamingEngineering/archive/master.zip

Or simply clone RealtimeStreamingEngineering with SSH
[email protected]:airscholar/RealtimeStreamingEngineering.git

If you have some problems with RealtimeStreamingEngineering

You may open issue on RealtimeStreamingEngineering support forum (system) here: https://github.com/airscholar/RealtimeStreamingEngineering/issues

Similar to RealtimeStreamingEngineering repositories

Here you may see RealtimeStreamingEngineering alternatives and analogs

 grafana    elasticsearch    FOSElasticaBundle    crawler    bookbrainz-site    elastic4s    elk-docker    dev-setup    zentral    Opserver    elasticsearch-HQ    pipeline    sentinl    awesome-aws    yii2-elasticsearch    great-big-example-application    gardening    dejavu    mirage    kibana    NewsBlur    analysis-ik    docker-elk    elasticsearch-sql    Linux-Tutorial    searchkit    elasticsearch-dump    peek    elastic    vue-storefront