smag-mvp

module

v0.0.0-...-ae3587a Latest Latest Go to latest Published: Mar 24, 2020 License: LGPL-3.0

Details

Valid go.mod file
Redistributable license
Tagged version
Stable version
Learn more about best practices

Repository

github.com/codeuniversity/smag-mvp

README ¶

Distributed scraping and analysis pipeline for a range of social media platforms

Shields

Table of content

About
Architectural overview
Further reading
Getting started

About

The goal of this project is to raise awareness about data privacy. The mean to do so is a tool to scrape, combine and analyze public data from multiple social media sources.
The results will be available via an API, used for some kind of art exhibition.

Architectural overview

You can find an more detailed overview here.
Open it in draw.io and have a look at the different tabs "High level overview", "Distributed Scraper" and "Face Search".

Github handle	Real name	Instagram profile	Twitter profile
@1Jo1	Josef Grieb	josef_grieb	josefgrieb
@Urhengulas	Johann Hemmann	Urhengulas	Johann
@alexmorten	Alexander Martin	no profile :(	no profile :(
@jo-fr	Jonathan Freiberger	jonifreiberger	Jonathan
@m-lukas	Lukas Müller	lmglukas	Lukas Müller
@lukas-menzel	Lukas Menzel	lukasmenzel	Lukas Menzel
@SpringHawk	Martin Zaubitzer	/	/

Deployment

The deployment of this project to kubernetes happens in codeuniversity/smag-deploy (this is a private repo!)

Getting started

Requirements

depency	version
`go`	`v1.13` (go modules)
`docker`	`v19.x`
`docker-compose`	`v1.24.x`

Preparation

If this is your first time running this:

Add 127.0.0.1 my-kafka and 127.0.0.1 minio to your /etc/hosts file
Choose a <user_name> for your platform of choice <instagram|twitter> as a starting point and run
```
$ go run cli/main/main.go <instagram|twitter> <user_name>
```

Scraper

Run the instagram- or twitter-scraper in docker:

$ make run-<platform_name>

Directories ¶

Path	Synopsis
api
grpcserver
grpcserver/main
proto
aws_service
main
proto
cli
main
config
db
elastic
indexer
models
search/faces
search/facetest
face-recognition
main
faces
proto
recognitiontest
http_header-generator
imgproxy
insta
filter/post_face-recon
filter/post_pictures
filter/user_names
indexer/comments
indexer/faces
indexer/posts
indexer/users
inserter/comments
inserter/comments/main
inserter/likes
inserter/likes/main
inserter/neo4j/posts
inserter/neo4j/tagged_users
inserter/neo4j/user
inserter/postgres
inserter/postgres/main
inserter/posts
inserter/posts/main
inserter/posts_face
inserter/posts_face/main
models
pics-downloader
pics-downloader/main
posts_face-detection
posts_face-detection/main
scraper/comments
scraper/comments/main
scraper/likes
scraper/likes/main
scraper/posts
scraper/posts/main
scraper/user
scraper/user/main
kafka
changestream
neo4j
create-import-user-json
create-import-user-json/main
inserter
nlp
frequency-analyzer
frequency-analyzer/main
scraper-client
service
twitter
filter/user_names
inserter/posts
inserter/posts/main
inserter/users
inserter/users/main
models
utils
worker

part	docs	contact
Api	`api/README.md`	@jo-fr
Frontend	`frontend/README.md`	@lukas-menzel
Postgres DB	`db/README.md`	@alexmorten

?	: This menu
/	: Search site
f or F	: Jump to
y or Y	: Canonical URL

smag-mvp

README ¶

About

Architectural overview

Further reading

Detailed documentation

Wanna contribute?

List of contributors

Deployment

Getting started

Requirements

Preparation

Scraper

Directories ¶

README ¶

Social Record

About

Architectural overview

Further reading

Detailed documentation

Wanna contribute?

List of contributors

Deployment

Getting started

Requirements

Preparation

Scraper

Directories ¶