2016-01-19 14:17:16 +01:00
[![Build Status ](https://travis-ci.org/CIRCL/AIL-framework.svg?branch=master )](https://travis-ci.org/CIRCL/AIL-framework)
2014-08-06 11:43:40 +02:00
AIL
===
2016-02-08 11:34:54 +01:00
![Logo ](./doc/logo/logo-small.png?raw=true "AIL logo" )
2014-12-01 10:43:36 +01:00
AIL framework - Framework for Analysis of Information Leaks
2014-08-06 11:43:40 +02:00
2014-09-18 13:28:25 +02:00
AIL is a modular framework to analyse potential information leaks from unstructured data sources like pastes from Pastebin or similar services. AIL framework is flexible and can be extended to support other functionalities to mine sensitive information.
2014-08-06 11:43:40 +02:00
2016-08-23 17:05:45 +02:00
![Dashboard ](./doc/screenshots/dashboard.png?raw=true "AIL framework dashboard" )
2016-08-23 17:20:22 +02:00
2016-08-23 17:31:30 +02:00
Trending charts
---------------
2016-08-23 17:27:04 +02:00
2016-08-23 17:05:45 +02:00
![Trending-Web ](./doc/screenshots/trending-web.png?raw=true "AIL framework webtrending" )
![Trending-Modules ](./doc/screenshots/trending-module.png?raw=true "AIL framework modulestrending" )
2016-08-23 17:20:22 +02:00
2016-08-23 17:27:04 +02:00
Browsing
--------
2016-08-23 17:05:45 +02:00
![Browse-Pastes ](./doc/screenshots/browse-important.png?raw=true "AIL framework browseImportantPastes" )
2016-08-23 17:20:22 +02:00
2016-08-23 17:27:04 +02:00
Sentiment analysis
------------------
2016-08-23 17:05:45 +02:00
![Sentiment ](./doc/screenshots/sentiment.png?raw=true "AIL framework sentimentanalysis" )
2016-08-23 17:20:22 +02:00
2016-08-23 17:27:04 +02:00
Terms manager and occurence
---------------------------
2016-08-23 17:09:28 +02:00
![Term-Manager ](./doc/screenshots/terms-manager.png?raw=true "AIL framework termManager" )
2016-08-23 17:31:30 +02:00
## Top terms
2016-08-23 17:09:28 +02:00
![Term-Top ](./doc/screenshots/terms-top.png?raw=true "AIL framework termTop" )
![Term-Plot ](./doc/screenshots/terms-plot.png?raw=true "AIL framework termPlot" )
2016-08-23 17:05:45 +02:00
2014-08-06 11:43:40 +02:00
2014-09-22 13:36:13 +02:00
AIL framework screencast: https://www.youtube.com/watch?v=9idfHCIMzBY
2014-09-22 13:35:46 +02:00
2016-02-08 11:49:33 +01:00
Features
--------
2016-02-08 14:13:24 +01:00
* Modular architecture to handle streams of unstructured or structured information
* Default support for external ZMQ feeds, such as provided by CIRCL or other providers
* Each module can process and reprocess the information already processed by AIL
* Detecting and extracting URLs including their geographical location (e.g. IP address location)
2016-02-08 11:49:33 +01:00
* Extracting and validating potential leak of credit cards numbers
* Extracting and validating email addresses leaked including DNS MX validation
* Module for extracting Tor .onion addresses (to be further processed for analysis)
* Extracting and validating potential hostnames (e.g. to feed Passive DNS systems)
* A full-text indexer module to index unstructured information
2016-08-23 17:20:22 +02:00
* Modules and web statistics
* Global sentiment analysis for each providers based on nltk vader module
* Terms tracking and occurence
2016-02-08 14:13:24 +01:00
* Many more modules for extracting phone numbers, credentials and others
2016-02-08 11:49:33 +01:00
2016-02-08 11:16:53 +01:00
Installation
------------
2014-08-06 11:43:40 +02:00
2014-09-18 13:28:25 +02:00
Type these command lines for a fully automated installation and start AIL framework
2014-08-25 15:02:53 +02:00
```
git clone https://github.com/CIRCL/AIL-framework.git
cd AIL-framework
./installing_deps.sh
cd var/www/
./update_thirdparty.sh
cd ~/AIL-framework/
. ./AILENV/bin/activate
cd bin/
./LAUNCH.sh
```
2016-02-08 14:13:24 +01:00
The default [installing_deps.sh ](./installing_deps.sh ) is for Debian and Ubuntu based distributions. For Arch
linux based distributions, you can replace it with [installing_deps_archlinux.sh ](./installing_deps_archlinux.sh ).
2014-08-25 15:02:53 +02:00
2016-02-08 14:13:24 +01:00
There is also a [Travis file ](.travis.yml ) used for automating the installation that can be used to build and install AIL on other systems.
2014-08-06 11:43:40 +02:00
2016-08-23 17:20:22 +02:00
2016-02-08 11:16:53 +01:00
Starting AIL web interface
--------------------------
2014-08-06 11:43:40 +02:00
2016-02-08 14:13:24 +01:00
To start the web interface, you first need to fetch the required Javascript/CSS files:
2014-08-08 11:42:51 +02:00
```
cd $AILENV
cd var/www/
bash update_thirdparty.sh
```
2016-02-08 14:13:24 +01:00
and then you can start the web interface python script:
2014-08-08 11:42:51 +02:00
```
cd $AILENV
cd var/www/
Flask_server.py
```
2016-02-08 14:13:24 +01:00
Eventually you can browse the status of the AIL framework website at the following URL:
2014-08-06 11:43:40 +02:00
``http://localhost:7000/``
2016-02-08 10:39:32 +01:00
How to create a new module
--------------------------
2014-08-06 11:43:40 +02:00
2016-02-08 14:13:24 +01:00
If you want to add a new processing or analysis module in AIL, follow these simple steps:
2014-08-06 11:43:40 +02:00
2016-02-08 10:43:58 +01:00
1. Add your module name in [./bin/packages/modules.cfg ](./bin/packages/modules.cfg ) and subscribe to the Redis_Global at minimum.
2014-08-06 11:43:40 +02:00
2016-02-08 10:43:58 +01:00
2. Use [./bin/template.py ](./bin/template.py ) as a sample module and create a new file in bin/ with the module name used in the modules.cfg configuration.
2014-08-06 11:43:40 +02:00
2016-02-08 11:55:39 +01:00
How to contribute a module
--------------------------
Feel free to fork the code, play with it, make some patches or add additional analysis modules.
To contribute your module, feel free to pull your contribution.
2014-08-06 11:43:40 +02:00
2014-08-20 15:31:10 +02:00
Redis and LevelDB overview
--------------------------
2014-08-20 15:33:26 +02:00
* Redis on TCP port 6379 - DB 1 - Paste meta-data
* DB 0 - Cache hostname/dns
* Redis on TCP port 6380 - Redis Pub-Sub only
* Redis on TCP port 6381 - DB 0 - Queue and Paste content LRU cache
2016-08-23 17:20:22 +02:00
* Redis on TCP port 6382 - DB 1-4 - Trending, terms and sentiments
2014-08-20 15:33:26 +02:00
* LevelDB on TCP port < year > - Lines duplicate
2014-08-20 15:31:10 +02:00
2014-08-06 11:43:40 +02:00
LICENSE
-------
```
Copyright (C) 2014 Jules Debra
2016-02-08 10:39:32 +01:00
Copyright (C) 2014-2016 CIRCL - Computer Incident Response Center Luxembourg (c/o smile, security made in Lëtzebuerg, Groupement d'Intérêt Economique)
Copyright (c) 2014-2016 Raphaël Vinot
Copyright (c) 2014-2016 Alexandre Dulaunoy
2016-08-19 13:34:02 +02:00
Copyright (c) 2016 Sami Mokaddem
2014-08-06 11:43:40 +02:00
This program is free software: you can redistribute it and/or modify
it under the terms of the GNU Affero General Public License as published by
the Free Software Foundation, either version 3 of the License, or
(at your option) any later version.
This program is distributed in the hope that it will be useful,
but WITHOUT ANY WARRANTY; without even the implied warranty of
MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the
GNU Affero General Public License for more details.
You should have received a copy of the GNU Affero General Public License
along with this program. If not, see < http: / / www . gnu . org / licenses / > .
```