# BGP-Ranking New version of BGP Ranking, complete rewrite in python3.6+ and an ARDB backend # Directory structure *Config files*: `listimport / modules_config / *.json` *Per-module parsers*: `listimport / parsers` *Libraries* : `listimport / libs` # Raw dataset directory structure ## Files to import ` / / ` ## Last modified date (if possible) and lock file ` / / / meta` ## Imported files less than 2 months old ` / / / archive` ## Imported files more than 2 months old ` / / / archive / deep` # Databases ## Intake (redis, port 6579) *Usage*: All the modules push their entries in this database. Creates the following hashes: ```python UUID = {'ip': , 'source': , 'datetime': } ``` Creates a set `intake` for further processing containing all the UUIDs. ## Pre-Insert (redis, port 6580) *Usage*: Make sure th IPs are global, validate input from the intake module. Pop UUIDs from `intake`, get the hashes with that key Creates the following hashes: ```python UUID = {'ip': , 'source': , 'datetime': , 'date': } ``` Creates a set `to_insert` for further processing containing all the UUIDs. Creates a set `for_ris_lookup` to lookup on the RIS database. Contains all the IPs. ## Routing Information Service cache (redis, port 6581) *Usage*: Lookup IPs against the RIPE's RIS database Pop IPs from `for_ris_lookup`. Creates the following hashes: ```python IP = {'asn': , 'prefix': , 'description': } ``` ## Ranking Information cache (redis, port 6582) *Usage*: Store the current list of known ASNs at RIPE, and the prefixes originating from them. Creates the following sets: ```python asns = set([, ...]) |v4 = set([, ...]) |v6 = set([, ...]) ``` And the following keys: ```python |v4|ipcount = |v6|ipcount = ``` ## Long term storage (ardb, port 16579) *Usage*: Stores the IPs with the required meta informations required for ranking. Pop UUIDs from `to_insert`, get the hashes with that key Use the IP from that hash to get the RIS informations. Creates the following sets: ```python # All the sources, by day |sources = set([, ...]) # All the ASNs by source, by day | -> set([, ...]) # All the prefixes, by ASN, by source, by day || -> set([, ...]) # All the tuples (ip, datetime), by prefixes, by ASN, by source, by day ||| -> set([|, ...]) ```