Skip to content

Latest commit

 

History

History
45 lines (26 loc) · 1.16 KB

README.md

File metadata and controls

45 lines (26 loc) · 1.16 KB

Famous Twin

This is a Web application where users can enter the day, month, and year to see a list of famous people born on that exact same date. The data is collected from the Births sections on Wikipedia's day pages, like this.

But this is really just a way for me to learn and play around with some cool technologies.

Technology Stack

  • Python script scrapes Wikipedia using mwclient.
  • Birthday data stored in MongoDB. (Although right now it's just a JSON file and resides in memory when app server is running.)
  • Node.js service for serving birthday requests via JSONP calls.
  • jQuery and Knockout JavaScript libraries in the front end.

Instructions

Prerequisites

  • Python 2.7
  • Node 0.6.19+

Optional

  • python-dev package for simplejson speedups

Python Wikipedia Scraper

Set Up

From root of repo--

pip install -r requirements.txt

To run scraper to generate ./birthdays.json--

cd fetch_data
python wikiparse.py

To run tests--

cd fetch_data
python test_wikiparse.py

Server

TBD