35
README.md
|
@ -9,8 +9,34 @@ AIL framework - Framework for Analysis of Information Leaks
|
|||
|
||||
AIL is a modular framework to analyse potential information leaks from unstructured data sources like pastes from Pastebin or similar services. AIL framework is flexible and can be extended to support other functionalities to mine sensitive information.
|
||||
|
||||

|
||||

|
||||

|
||||
|
||||
Trending charts
|
||||
---------------
|
||||
|
||||

|
||||

|
||||
|
||||
Browsing
|
||||
--------
|
||||
|
||||

|
||||
|
||||
Sentiment analysis
|
||||
------------------
|
||||
|
||||

|
||||
|
||||
Terms manager and occurence
|
||||
---------------------------
|
||||
|
||||

|
||||
|
||||
## Top terms
|
||||
|
||||

|
||||

|
||||
|
||||
|
||||
AIL framework screencast: https://www.youtube.com/watch?v=9idfHCIMzBY
|
||||
|
||||
|
@ -26,6 +52,9 @@ Features
|
|||
* Module for extracting Tor .onion addresses (to be further processed for analysis)
|
||||
* Extracting and validating potential hostnames (e.g. to feed Passive DNS systems)
|
||||
* A full-text indexer module to index unstructured information
|
||||
* Modules and web statistics
|
||||
* Global sentiment analysis for each providers based on nltk vader module
|
||||
* Terms tracking and occurence
|
||||
* Many more modules for extracting phone numbers, credentials and others
|
||||
|
||||
Installation
|
||||
|
@ -48,6 +77,7 @@ linux based distributions, you can replace it with [installing_deps_archlinux.sh
|
|||
|
||||
There is also a [Travis file](.travis.yml) used for automating the installation that can be used to build and install AIL on other systems.
|
||||
|
||||
|
||||
Starting AIL web interface
|
||||
--------------------------
|
||||
|
||||
|
@ -94,6 +124,7 @@ Redis and LevelDB overview
|
|||
* DB 0 - Cache hostname/dns
|
||||
* Redis on TCP port 6380 - Redis Pub-Sub only
|
||||
* Redis on TCP port 6381 - DB 0 - Queue and Paste content LRU cache
|
||||
* Redis on TCP port 6382 - DB 1-4 - Trending, terms and sentiments
|
||||
* LevelDB on TCP port <year> - Lines duplicate
|
||||
|
||||
LICENSE
|
||||
|
|
After Width: | Height: | Size: 126 KiB |
After Width: | Height: | Size: 190 KiB |
After Width: | Height: | Size: 56 KiB |
After Width: | Height: | Size: 63 KiB |
After Width: | Height: | Size: 31 KiB |
After Width: | Height: | Size: 86 KiB |
After Width: | Height: | Size: 54 KiB |
After Width: | Height: | Size: 57 KiB |
|
@ -83,5 +83,6 @@ pushd tlsh/py_ext
|
|||
python setup.py build
|
||||
python setup.py install
|
||||
|
||||
# Download the necessary NLTK corpora
|
||||
# Download the necessary NLTK corpora and sentiment vader
|
||||
HOME=$(pwd) python -m textblob.download_corpora
|
||||
python -m nltk.downloader vader_lexicon
|
||||
|
|