Lookyloo is a web interface allowing to scrape a website and then displays a tree of domains calling each other. https://lookyloo.circl.lu/
Go to file
Michael Schaefer 51ed616b48 Merge remote-tracking branch 'upstream/master' 2018-04-08 17:45:36 +02:00
doc chg: Add screenshot 2018-01-05 17:10:38 +01:00
etc All the pyqt4 deps are optional. 2017-10-02 11:10:32 +02:00
lookyloo new: allow to download the URLs list in a text file 2018-04-06 18:06:41 +02:00
.gitignore Add initial web interface 2017-07-23 19:58:08 +02:00
Dockerfile chg: Update docker file for d3v5 2018-04-05 11:22:13 +02:00
LICENSE Update LICENSE 2018-03-16 11:54:46 +01:00
README.md chg: Use d3v5, cleanup. 2018-04-05 11:17:26 +02:00
__init__.py Add config to run as service behind nginx 2017-08-12 20:12:14 +02:00
docker-compose.yml Merge remote-tracking branch 'upstream/master' 2018-03-16 12:13:37 +01:00
lookyloo.ini fix: disable file-wrapper 2018-03-22 18:50:37 +01:00
requirements.txt fix: Missing dependency (beautifulsoup4) 2018-03-16 11:57:54 +01:00
setup.py Add initial web interface 2017-07-23 19:58:08 +02:00
wsgi.py Add config to run as service behind nginx 2017-08-12 20:12:14 +02:00

README.md

Lookyloo icon

Lookyloo is a web interface allowing to scrape a website and then displays a tree of domains calling each other.

What is that name?!

1. People who just come to look.
2. People who go out of their way to look at people or something often causing crowds and more disruption.
3. People who enjoy staring at watching other peoples misfortune. Oftentimes car onlookers to car accidents.
Same as Looky Lou; often spelled as Looky-loo (hyphen) or lookylou
In L.A. usually the lookyloo's cause more accidents by not paying full attention to what is ahead of them.

Source: Urban Dictionary

Screenshot

Screenshot of Lookyloo

Implementation details

This code is very heavily inspired by webplugin and adapted to use flask as backend.

Installation of har2tree

The core dependency is ETE Toolkit, which you can install following the guide on the official website

Note: all the PyQt4 dependencies are optional.

Installation of scrapysplashwrapper

You need a running splash instance, preferably on docker

sudo apt install docker.io
sudo docker pull scrapinghub/splash
sudo docker run -p 8050:8050 -p 5023:5023 scrapinghub/splash --disable-ui --disable-lua
# On a server with a decent abount of RAM, you may want to run it this way:
# sudo docker run -p 8050:8050 -p 5023:5023 scrapinghub/splash --disable-ui -s 100 --disable-lua -m 50000

Installation of the whole thing

pip install -r requirements.txt
pip install -e .
wget https://d3js.org/d3.v5.min.js -O lookyloo/static/d3.v5.min.js
wget https://cdn.rawgit.com/eligrey/FileSaver.js/5733e40e5af936eb3f48554cf6a8a7075d71d18a/FileSaver.js -O lookyloo/static/FileSaver.js

Run the app locally

export FLASK_APP=lookyloo
flask run

With a reverse proxy (Nginx)

pip install uwsgi

Config files

You have to configure the two following files:

  • etc/nginx/sites-available/lookyloo
  • etc/systemd/system/lookyloo.service

And copy them to the appropriate directories and run the following command:

sudo ln -s /etc/nginx/sites-available/lookyloo /etc/nginx/sites-enabled

If needed, remove the default site

sudo rm /etc/nginx/sites-enabled/default

Make sure everything is working:

sudo systemctl start lookyloo
sudo systemctl enable lookyloo
sudo nginx -t
# If it is cool:
sudo service nginx restart

And you can open http://<IP-or-domain>/

Now, you should configure TLS (let's encrypt and so on)