Twitifynd

CS 333: Algorithms in the real world final project

Setup

This is currently configured to run Docker within an Ubuntu 20.04 host (vcm.duke.edu).
Follow instructions for installing stable Docker Engine version
Follow instructions for installing stable Docker Compose on linux
Run cp template.env .env and add the necessary tokens.
Run source .env
Run sudo docker-compose up --build

See [init.sh] and create.sh for current behavior and TODOs.

Steps 6-7 will need to be rerun on any code changes until hot deploy is configured.

Analysis

To view current docker container information, run docker ps (may need to be run as sudo)

While the scripts are running/containers are live, you can view the current status of the database by running:

docker inspect twitifynd_db_1 | grep IPAddress to get the local IP of your db container.
psql -h <IPAddress> -U postgres twitifynd. The password can be found in the docker-compose file.

Loading Data

(ON YOUR COMPUTER) Download the files from my email and unzip them
(ON YOUR COMPUTER) scp -r ~/Downloads/log_2021-11-29_00:37:16 vcm@vcm_url:~/log_2021-11-29_00:37:16 (replace the first part with wherever you downloaded it to and the second part with the name of your vcm hostname. You may find it helpful to make a Documents directory at the destination rather than just putting it in home (~))
(ON YOUR VM) sudo cp -r ~/log_2021-11-29_00:37:16 /var/lib/docker/volumes/twitifynd_script_data/_data/log_2021-11-29_00:37:16 (requires you to have run the docker stuff at least once for that directory to exist)
(ON YOUR VM) Go to docker-compose.yml and make sure DB_PROCESS on line 29 is set to 2 (this forces it to load from csv’s that we just pasted)
(ON YOUR VM) Run sudo docker-compose up --build twitifynd

Architecture Overview

This repository is set up using docker-compose.yml. This file outlines how the different containers are configured. At the time of writing, there are 2 standard images (a mail server (unauthenticated) and a postgresql server) and 1 custom container. This is specified in the Dockerfile.

There are different options for running the latter, which has an entrypoint in init.sh. Depending on the configuration, it will make sure the database is up, the tables have been created, and the necessary data is loaded from backups before proceeding to scrape data for spotify and twitter analysis.

Name		Name	Last commit message	Last commit date
Latest commit History 73 Commits
db		db
log_2021-11-29_05_34_18		log_2021-11-29_05_34_18
log_2021-12-06_04_51_47		log_2021-12-06_04_51_47
precomp_diff_clusters		precomp_diff_clusters
prescraped		prescraped
spotify		spotify
toy_precomputed		toy_precomputed
twitter		twitter
utils		utils
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
Survey Results.ipynb		Survey Results.ipynb
connectivity_test.py		connectivity_test.py
docker-compose.yml		docker-compose.yml
init.sh		init.sh
logs_demo.ipynb		logs_demo.ipynb
missing_twitter_parser.py		missing_twitter_parser.py
precomputation.py		precomputation.py
recommend.py		recommend.py
requirements.txt		requirements.txt
template.env		template.env
toy_demo.ipynb		toy_demo.ipynb
toy_precomputation.py		toy_precomputation.py
toy_recommend.py		toy_recommend.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Twitifynd

Setup

Analysis

Loading Data

Architecture Overview

About

Releases

Packages

Contributors 3

Languages

License

TylerJang27/Twitifynd

Folders and files

Latest commit

History

Repository files navigation

Twitifynd

Setup

Analysis

Loading Data

Architecture Overview

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages