Monsef Rachid
Home About Projects Contact

GlobalFootballDaily.com

Football Data Web Scraper & News Aggregation Website

Project Overview

This project showcases a fully automated web scraper and news aggregation platform tailored for football enthusiasts. It features a robust content management system (CMS) for managing news categories, topics, data sources, and scraper rules. The backend automates data collection using topic-based scraping, ensuring relevant and up-to-date content. The frontend is designed as a user-friendly news website, neatly organized by categories and topics, providing seamless navigation and a structured presentation of curated football news. This project highlights expertise in web scraping, automation, CMS development, and frontend design.

Tools Used

Laravel PHP JavaScript Alpine.js MySQL HTML Tailwind CSS GitHub Actions

Project Details

  • Built the application using Laravel with Blade templates for server-side rendering and Alpine.js for interactive user experiences.
  • Designed and implemented a responsive frontend using Tailwind CSS, ensuring a modern and user-friendly interface.
  • Developed a backend content management system (CMS) for managing news topics, categories, data sources, and scraping rules.
  • Engineered a system to customize web scrapers tailored to each website's structure, enabling administrators to easily add new data sources by defining HTML classes and attributes for scraping.
  • Implemented a CI/CD pipeline using GitHub Actions to automate testing, deployment, and monitoring of code changes.
  • Deployed the application on an AWS EC2 instance running a LAMP stack, configured with robust security measures and HTTPS encryption.
  • Optimized scraping logic to handle dynamic content and prevent issues with rate limits or bot detection.
  • Set up logging and monitoring to track scraper performance, error handling, and data collection accuracy.
  • Ensured scalability and maintainability by adhering to clean code principles, modular architecture, and reusable components.
  • Implemented automated testing, including unit tests for backend logic, integration tests for API functionality, and end-to-end tests for the scraping system and user interface.