Sannie Bot: BBC Burmese News on Telegram

This project is a Telegram web app bot called "Sannie." Sannie crawls the BBC Burmese website to display news content, allowing Telegram users to read the news without leaving the app. The entire user interface is displayed in Burmese language, providing a native experience for Burmese-speaking users.

1. 🚀 User Manual

A Telegram account is required.

Direct Link: Chat the bot directly
Search Method: Find the bot in Telegram's search bar:

@presenter_sannie_bot

1.1. Bot Features

Inline Button - Interactive buttons within messages
Keyboard Button - Custom keyboard for easy navigation
Inline Mode - Search and share content directly from any chat

1.2. Available Commands

/start - greet and return the main web app
/help - describe how to use this bot
/keyboard - return the keyboard button

1.3. Using the Web App

Once the bot is started, it will automatically greet with a direct link to the website. The user can then:

Browse by topic: Choose a topic, then a page, then click "Read" button to view content/article (Topic mode)
Enter a single link to read content/article directly (Insert Link mode)
Navigate between pages using Previous/Next buttons without returning to page selection

Note: All UI elements, buttons, and navigation are displayed in Burmese language for better accessibility.

2. 📊 Current Version

The webscraper can

scrape all topics (all pages of each topic) from BBC Burmese,
scrape Burmese content/article with a filter,
export scraped data in spreadsheet,
store scraped data in DynamoDB (permanent storage),
cache frequently accessed data in Redis (fast cache),

3. 📁 Files and Directories

/.github/workflows - CI/CD pipeline (Ruff lint, ESLint, pytest on push/PR to main)
- ci.yml - runs lint (Python + JS) and tests
/caching prototypes - Development and testing files for caching system
/db - DynamoDB Local database files and scripts
/docs - Frontend files for GitHub Pages hosting
/img - Project images and assets
/notebooks - Jupyter notebooks for webscraper development and documentation
/spreadsheets - Exported data (ignored in version control)
/telegram-bot - Telegram bot scripts (requires .env file for tokens)
- app.py - Main bot application
- credentials.py - Environment variable handler
- Dockerfile - Bot container configuration
- Procfile - Bot deployment configuration (Railway / Render)
- requirements.txt - Bot-specific dependencies
/tests - Pytest tests
- conftest.py - shared fixtures (mocked Redis, DynamoDB, scraper; FastAPI client)
- test_api.py - FastAPI endpoint tests
- test_dynamo_helpers.py - db.dynamo_helpers tests
- test_telegram_bot.py - bot handler tests
/webscraper - Main Python web scraping scripts and modules
- /modules - Modular scraping scripts
api.py - FastAPI server for web scraping endpoints
docker-compose.yml - Orchestrates all Docker services (FastAPI, Redis, Telegram Bot)
Dockerfile - FastAPI app container configuration
flow.md - Control flow documentation
DEVELOPMENT_GUIDE.md - Local development setup guide
DEPLOYMENT_GUIDE.md - Production deployment guide
Procfile - FastAPI app deployment configuration (Railway / Render)
pyproject.toml - Ruff linter configuration
pytest.ini - Pytest configuration
requirements.txt - Runtime dependencies
requirements-dev.txt - Development and test dependencies
.eslintrc.json - ESLint configuration for JavaScript
.eslintignore - Paths ignored by ESLint

4. 📈 Project Development

Web Scraper Development - Check all notebooks in the /notebooks folder for detailed documentation on how the customized web-crawler evolved from scratch
Integration - The web scraper is combined with multiple components:
- Frontend (web-hosted with GitHub Pages) with improved navigation and direct view buttons
- Telegram bot (created to use the hosted frontend)
- Redis cache (fast in-memory storage for frequently accessed data)
- DynamoDB (permanent storage for all scraped data)
- Three-tier caching strategy: Redis -> DynamoDB -> Web scraping

5. 📚 Documentation

CONTROL_FLOW.md - Control flow documentation and user journey
DEVELOPMENT_GUIDE.md - Local development setup and commands
DEPLOYMENT_GUIDE.md - Production deployment instructions (Railway/Render)

License

This project is intended for educational purposes.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Sannie Bot: BBC Burmese News on Telegram

1. 🚀 User Manual

1.1. Bot Features

1.2. Available Commands

1.3. Using the Web App

2. 📊 Current Version

3. 📁 Files and Directories

4. 📈 Project Development

5. 📚 Documentation

License

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 147 Commits
.github/workflows		.github/workflows
caching prototypes		caching prototypes
db		db
docs		docs
img		img
notebooks		notebooks
spreadsheets		spreadsheets
telegram-bot		telegram-bot
tests		tests
webscraper		webscraper
.dockerignore		.dockerignore
.eslintignore		.eslintignore
.eslintrc.json		.eslintrc.json
.gitattributes		.gitattributes
.gitignore		.gitignore
CONTROL_FLOW.md		CONTROL_FLOW.md
DEPLOYMENT_GUIDE.md		DEPLOYMENT_GUIDE.md
DEVELOPMENT_GUIDE.md		DEVELOPMENT_GUIDE.md
Dockerfile		Dockerfile
Procfile		Procfile
README.md		README.md
api.py		api.py
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements-dev.txt		requirements-dev.txt
requirements.txt		requirements.txt

Folders and files

Latest commit

History

Repository files navigation

Sannie Bot: BBC Burmese News on Telegram

1. 🚀 User Manual

1.1. Bot Features

1.2. Available Commands

1.3. Using the Web App

2. 📊 Current Version

3. 📁 Files and Directories

4. 📈 Project Development

5. 📚 Documentation

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages