Telegram Crawler

NOTE: None of Exa’s projects are open source. So, I am not allowed to share the code which I have developed in this project.

Introduction

This project is developed for crawling data from public channels and groups. Collected data is later being analyzed to be accessible by our users.

Data Crawling

All posts and messages from telegram public channels and groups are collected on daily basis. For groups, participants are also extracted. With the help of NLP modules, data is analyzed and visualized, then users can have access to the analysis through a UI platform. They can also search the keywords and get the related news and data.

Technologies/Languages Used

Technology Usage
Python is the language used for this project
RabbitMQ is used to publish tasks for the service
PostgreSQL is used for credentials and authentication parameters
Kafka is used to produce collected data
MongoDB is used to store produced data
Redis is used to store telegram sessions
Flask is used for authentication service
Docker is used for virtualization and containerizing services
Git is used for version control

Project information

  • Category: Software
  • Project date: August 2021

Project Description