misp-website/_posts/2019-09-12-Decaying-Of-Indi...

---
title: Decaying of Indicators - MISP improved model to expire indicators based on custom models
layout: post
featured: /assets/images/misp/blog/decay.png
---

# An improved and flexible model to expire indicators

This blog post introduces the new functionality of MISP 2.4.116 which allows users and organisations to easily expire information depending on their objectives and targets.

MISP being a distributed system, various users and organisations are sharing data, sometimes without even knowing each others. While having access to a lot of information is extremely beneficial for all parties, it, however, also induces challenges to deal with.

In this blog post, we will mainly discuss about information **quality** and **freshness**, other issues like **trust**, **use-cases**, **interests**, etc. are partially taken into account but will not be presented. Nevertheless, these concepts are examined in this [paper](https://arxiv.org/abs/1902.03914) along with a detailed explanation of the solution we have chosen to tackle these issues.

Our main objective is to provide users a **simple yet customizable system** to automatically (or manually) mark an *Indicator Of Compromise* (or more generic, an *Attribute*) as **expired**.

Before getting started to show how the model presented in the paper is implemented in MISP, we first need to have a look at some concepts needed to better understand how components are working and tied together.

# The (potentially) annoying bits of theory

The solution currently supported in MISP is based on two components: ``base_score`` and ``score``. The idea is to have an initial fixed value called ``base_score`` taking into account the **quality** of an indicator; and a time-dependant ``score``, which decreases the more time passes.

A simplified version would be something like this:
```
score = base_score * P
```
Where ``P`` is composed of ``parameters``:
- ``lifetime``: The lifetime of the IOC or the time at which the score of the *Attribute*'s score will be 0
- ``decay_speed``: The speed at which the decay happens or the speed at which an *Attribute* will loose score

**⚠** **It should be noted that everytime a [*Sightings*](https://www.circl.lu/doc/misp/sightings/) is added to an *Attribute*, the ``score`` is refresh to the ``base_score`` and a new decay is initiated from that point.**

# Polynomial Decaying Model built-in in MISP

We still have to see how the ``base_score`` is actually computed. In the built-in version of the *Decaying Model* in MISP, the ``base_score`` is computed from the *Taxonomies* and some weight. Weights are a mean to prioritize extracted ``numerical_values`` from *Taxonomies* over others.

To give the intuition of how the ``base_score`` computation works, let's look at two examples. In these examples, the two *Taxonomies* used are
[*phishing*](https://github.com/MISP/misp-taxonomies/blob/master/phishing/machinetag.json) and [*admiralty-scale*](https://github.com/MISP/misp-taxonomies/blob/master/admiralty-scale/machinetag.json). Both of them contain *Tags* that have a ``numerical_value`` associated to them:
- <img src="/assets/images/misp/blog/decaying//tag-as-A.png" alt="admiraly-scale:source-reliability = Completely reliable" width="300"/>, ``numerical_value = 100``
- <img src="/assets/images/misp/blog/decaying//tag-as-D.png" alt="admiraly-scale:source-reliability = Not usually reliable" width="300"/>, ``numerical_value = 25``
- <img src="/assets/images/misp/blog/decaying//tag-p-H.png" alt="phishing:psychological-acceptability = high" width="250"/>, ``numerical_value = 75``

So, if an *Attribute* only have one *Tag* attached, let's say ``admiralty-scale:source-reliability="Completely reliable"``, the ``base_score`` would be:
```
base_score = 100
```

Weights come into action when multiple *Tags* are attached to an *Attribute*. To make things a bit easier, let's suppose that both *Taxonomies* should have the same importance in regards to the *Attribute*'s score. Thus, the total weight (100) will be shared, assigning both *Taxonomy* a weight of 50.

```
admiralty-scale = 50
phishing        = 50
---------------------
sum              100
```
If an *Attribute* has the *Tags* <img src="/assets/images/misp/blog/decaying/tag-as-A.png" alt="admiraly-scale:source-reliability = Completely reliable" width="300"/> and <img src="/assets/images/misp/blog/decaying/tag-p-H.png" alt="phishing:psychological-acceptability = high" width="250"/> attached, the computation steps would look like this:

![base_score computation steps](/assets/images/misp/blog/decaying/bs-computation-steps.png)

Thus, the ``base_score`` of this *Attribute* will be ``87.50``.


# Short tutorial

Now that we've seen the basic concepts, let's have a look at how MISP implements these components. For these examples, we are using the default [phishing model](https://github.com/MISP/misp-decaying-models/blob/master/models/phishing-model.json) model on a **test** *Event*.

## Practical integration in MISP

### Endpoint: ``events/view``

At the *Event* level, a new filtering button has been added to attach the real-time computed ``score`` of any *Attributes* that has been mapped to a *Model*.

<img src="/assets/images/misp/blog/decaying/dm-event.png" alt="Decaying Model index" width="700"/>

### Endpoint: ``attribute/restSearch``

The ``attribute/restSearch`` endpoint has been updated and now supports four new parameters to filter out expired *Attributes* or play with the different available models.
- ``includeDecayScore`` **[bool]**: Attach the real-time computed ``score`` of the *Attribute* along with *Model(s)* informations
- ``excludeDecayed`` **[bool]**: Filter out all expired IOC
- ``decayingModel`` **[list]**: List of *Model(s)*, which will be attached to the *Attribute*
- ``modelOverrides`` **[dict]**: JSON that can be used to on-the-fly modify *Model(s)* parameters

Example
```
// attribute/restSearch query that gets every `ip-src` attributes being tagged with tlp or phishing,
// not being expired,
// with a overriden model threshold of 30 for the two models with id 84 and 12.
{
    "type": "ip-src",
    "tags": ["tlp:%","phishing:%"],
    "includeDecayScore": 1,
    "excludeDecayed": 1,
    "modelOverrides": {
        "threshold": 30
    }
    "decayingModel": [84, 12],
}
```

## Default and Custom Models

In MISP, Some *Decaying Models* called **Default Models** will be supplied by default. Similarly to *Taxonomies*, *Galaxies* or *misp-objects*, *Decaying Models* will have their [own repository](https://github.com/MISP/misp-decaying-models) and will have the possibility to be updated directly from the UI via a single click. **Default Models** are available to everyone, meaning that they can been viewed and customized by any users having a presence on the MISP instance.

**Custom Models** are user-defined models that are shared to other users. However, if desired, they can be hidden by turning off the sharing flag, similarly to the *Tag Collection* feature.

## Decaying Fine Tuning Tool: Setting parameters and mapping model to *Attribute* types

When creating a new *Decaying Model*, setting a parameters and viewing its impact should be as easy and straight forward as possible. To do so, few widgets are shipped with the latest version of MISP.

### Customizing lifetime and decay speed parameters

<video src="/assets/images/misp/blog/decaying/dm-tool.mp4" title="Decaying Model Fine Tuning Tool - Parameters" width="800" height="450" controls autoplay loop>
  Your browser does not support the video tag.
</video>

### Setting the ``base_score``: Customizing Taxonomies' weight

<video src="/assets/images/misp/blog/decaying/dm-bs.mp4" type="video/mp4" title="Decaying Model Fine Tuning Tool - Base score" width="800" height="450" controls autoplay loop>
  Your browser does not support the video tag.
</video>

### Viewing scores and simulating the model

<video src="/assets/images/misp/blog/decaying/dm-simulation.mp4" type="video/mp4" title="Decaying Model Simulation Tool" width="800" height="450" controls autoplay loop>
  Your browser does not support the video tag.
</video>

# Developer perspective: Creating a model using a different algorithm

The Built-in Polynomial *Decaying Model* implemented in MISP allows any user to customize various components to achieve fine-grained decay behaviors. Still, it is possible that our model doesn't encompass your specific use-case. Thanks to the implemented architecture, any other formulas or algorithms can be added and used in a straightforward way.

Steps to create a new decay algorithm:
- Create a new file ``$filename`` in ``app/Model/DecayingModelsFormulas/``
- Extend the **Base** class ``DecayingModelBase``
- Implement the two functions ``computeScore`` and ``isDecayed`` with you own formula/algorithm
- Create a *Model* and set the ``formula`` field to ``$filename``

```
<?php
include_once 'Base.php';

class Polynomial extends DecayingModelBase
{
    public const DESCRIPTION = 'The description of your new decaying algorithm';

    public function computeScore($model, $attribute, $base_score, $elapsed_time)
    {
       // algorithm returning a numerical score
    }

    public function isDecayed($model, $attribute, $score)
    {
        // algorithm returning a boolean stating
        // if the attribute is expired or not
    }
}
?>
```

# Outcomes

Evaluating **quality** and **freshness** of IOCs is a problem commonly found in Threat Intelligence Platforms. We tried to solve it using a simple yet customizable system.

Upon release, MISP will be shipped with few models that could fit most use-cases. Still, we are eagerly waiting for contributions, fine-tunings or feedbacks from users. This would opens up plenty of opportunities including improved *Models*' precision, parameters tweaking or even integration of machine learning as a new *Model* algorithm.

Furthermore, we are not done yet! There are already improvements cooking in the MISP-Project oven,
- Integration of ``False Positive`` and ``Expiration`` *Sightings*
- Formula tweaking to provide better control on how to reset the ``base_score`` once a *Sighting* is created
- Per-user Taxonomies' ``numerical_value`` overrides
- Weights on *Tag*'s predicate level
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00			`---`
			`title: Decaying of Indicators - MISP improved model to expire indicators based on custom models`
			`layout: post`
chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			`featured: /assets/images/misp/blog/decay.png`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00			`---`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00			`# An improved and flexible model to expire indicators`

chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			`This blog post introduces the new functionality of MISP 2.4.116 which allows users and organisations to easily expire information depending on their objectives and targets.`

chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00			`MISP being a distributed system, various users and organisations are sharing data, sometimes without even knowing each others. While having access to a lot of information is extremely beneficial for all parties, it, however, also induces challenges to deal with.`

			`In this blog post, we will mainly discuss about information quality and freshness, other issues like trust, use-cases, interests, etc. are partially taken into account but will not be presented. Nevertheless, these concepts are examined in this [paper](https://arxiv.org/abs/1902.03914) along with a detailed explanation of the solution we have chosen to tackle these issues.`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
			`Our main objective is to provide users a simple yet customizable system to automatically (or manually) mark an Indicator Of Compromise (or more generic, an Attribute) as expired.`

			`Before getting started to show how the model presented in the paper is implemented in MISP, we first need to have a look at some concepts needed to better understand how components are working and tied together.`

			`# The (potentially) annoying bits of theory`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			The solution currently supported in MISP is based on two components: ``base_score`` and ``score``. The idea is to have an initial fixed value called ``base_score`` taking into account the quality of an indicator; and a time-dependant ``score``, which decreases the more time passes.

			`A simplified version would be something like this:`
			```
			`score = base_score * P`
			```
			Where ``P`` is composed of ``parameters``:
			- ``lifetime``: The lifetime of the IOC or the time at which the score of the Attribute's score will be 0
			- ``decay_speed``: The speed at which the decay happens or the speed at which an Attribute will loose score

			⚠ *It should be noted that everytime a [Sightings](https://www.circl.lu/doc/misp/sightings/) is added to an Attribute, the ``score`` is refresh to the ``base_score`` and a new decay is initiated from that point.*

			`# Polynomial Decaying Model built-in in MISP`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00
			We still have to see how the ``base_score`` is actually computed. In the built-in version of the Decaying Model in MISP, the ``base_score`` is computed from the Taxonomies and some weight. Weights are a mean to prioritize extracted ``numerical_values`` from Taxonomies over others.
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			To give the intuition of how the ``base_score`` computation works, let's look at two examples. In these examples, the two Taxonomies used are
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			[phishing](https://github.com/MISP/misp-taxonomies/blob/master/phishing/machinetag.json) and [admiralty-scale](https://github.com/MISP/misp-taxonomies/blob/master/admiralty-scale/machinetag.json). Both of them contain Tags that have a ``numerical_value`` associated to them:
chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			- <img src="/assets/images/misp/blog/decaying//tag-as-A.png" alt="admiraly-scale:source-reliability = Completely reliable" width="300"/>, ``numerical_value = 100``
			- <img src="/assets/images/misp/blog/decaying//tag-as-D.png" alt="admiraly-scale:source-reliability = Not usually reliable" width="300"/>, ``numerical_value = 25``
			- <img src="/assets/images/misp/blog/decaying//tag-p-H.png" alt="phishing:psychological-acceptability = high" width="250"/>, ``numerical_value = 75``
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
			So, if an Attribute only have one Tag attached, let's say ``admiralty-scale:source-reliability="Completely reliable"``, the ``base_score`` would be:
			```
			`base_score = 100`
			```

chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00			`Weights come into action when multiple Tags are attached to an Attribute. To make things a bit easier, let's suppose that both Taxonomies should have the same importance in regards to the Attribute's score. Thus, the total weight (100) will be shared, assigning both Taxonomy a weight of 50.`

chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			```
			`admiralty-scale = 50`
			`phishing = 50`
			`---------------------`
			`sum 100`
			```
chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			`If an Attribute has the Tags <img src="/assets/images/misp/blog/decaying/tag-as-A.png" alt="admiraly-scale:source-reliability = Completely reliable" width="300"/> and <img src="/assets/images/misp/blog/decaying/tag-p-H.png" alt="phishing:psychological-acceptability = high" width="250"/> attached, the computation steps would look like this:`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			`![base_score computation steps](/assets/images/misp/blog/decaying/bs-computation-steps.png)`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
			Thus, the ``base_score`` of this Attribute will be ``87.50``.


			`# Short tutorial`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00
			`Now that we've seen the basic concepts, let's have a look at how MISP implements these components. For these examples, we are using the default [phishing model](https://github.com/MISP/misp-decaying-models/blob/master/models/phishing-model.json) model on a test Event.`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
			`## Practical integration in MISP`

			### Endpoint: ``events/view``
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			At the Event level, a new filtering button has been added to attach the real-time computed ``score`` of any Attributes that has been mapped to a Model.

chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			`<img src="/assets/images/misp/blog/decaying/dm-event.png" alt="Decaying Model index" width="700"/>`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
			### Endpoint: ``attribute/restSearch``
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			The ``attribute/restSearch`` endpoint has been updated and now supports four new parameters to filter out expired Attributes or play with the different available models.
			- ``includeDecayScore`` [bool]: Attach the real-time computed ``score`` of the Attribute along with Model(s) informations
			- ``excludeDecayed`` [bool]: Filter out all expired IOC
			- ``decayingModel`` [list]: List of Model(s), which will be attached to the Attribute
			- ``modelOverrides`` [dict]: JSON that can be used to on-the-fly modify Model(s) parameters

			`Example`
			```
			// attribute/restSearch query that gets every `ip-src` attributes being tagged with tlp or phishing,
			`// not being expired,`
			`// with a overriden model threshold of 30 for the two models with id 84 and 12.`
			`{`
			`"type": "ip-src",`
			`"tags": ["tlp:%","phishing:%"],`
			`"includeDecayScore": 1,`
			`"excludeDecayed": 1,`
			`"modelOverrides": {`
			`"threshold": 30`
			`}`
			`"decayingModel": [84, 12],`
			`}`
			```

			`## Default and Custom Models`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			`In MISP, Some Decaying Models called Default Models will be supplied by default. Similarly to Taxonomies, Galaxies or misp-objects, Decaying Models will have their [own repository](https://github.com/MISP/misp-decaying-models) and will have the possibility to be updated directly from the UI via a single click. Default Models are available to everyone, meaning that they can been viewed and customized by any users having a presence on the MISP instance.`

			`Custom Models are user-defined models that are shared to other users. However, if desired, they can be hidden by turning off the sharing flag, similarly to the Tag Collection feature.`

			`## Decaying Fine Tuning Tool: Setting parameters and mapping model to Attribute types`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00
			`When creating a new Decaying Model, setting a parameters and viewing its impact should be as easy and straight forward as possible. To do so, few widgets are shipped with the latest version of MISP.`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
			`### Customizing lifetime and decay speed parameters`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00
chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			`<video src="/assets/images/misp/blog/decaying/dm-tool.mp4" title="Decaying Model Fine Tuning Tool - Parameters" width="800" height="450" controls autoplay loop>`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			`Your browser does not support the video tag.`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00			`</video>`

			### Setting the ``base_score``: Customizing Taxonomies' weight
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			`<video src="/assets/images/misp/blog/decaying/dm-bs.mp4" type="video/mp4" title="Decaying Model Fine Tuning Tool - Base score" width="800" height="450" controls autoplay loop>`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			`Your browser does not support the video tag.`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00			`</video>`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00			`### Viewing scores and simulating the model`

chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			`<video src="/assets/images/misp/blog/decaying/dm-simulation.mp4" type="video/mp4" title="Decaying Model Simulation Tool" width="800" height="450" controls autoplay loop>`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			`Your browser does not support the video tag.`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00			`</video>`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
			`# Developer perspective: Creating a model using a different algorithm`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			`The Built-in Polynomial Decaying Model implemented in MISP allows any user to customize various components to achieve fine-grained decay behaviors. Still, it is possible that our model doesn't encompass your specific use-case. Thanks to the implemented architecture, any other formulas or algorithms can be added and used in a straightforward way.`

			`Steps to create a new decay algorithm:`
			- Create a new file ``$filename`` in ``app/Model/DecayingModelsFormulas/``
			- Extend the Base class ``DecayingModelBase``
			- Implement the two functions ``computeScore`` and ``isDecayed`` with you own formula/algorithm
			- Create a Model and set the ``formula`` field to ``$filename``

			```
			`<?php`
			`include_once 'Base.php';`

			`class Polynomial extends DecayingModelBase`
			`{`
			`public const DESCRIPTION = 'The description of your new decaying algorithm';`

			`public function computeScore($model, $attribute, $base_score, $elapsed_time)`
			`{`
			`// algorithm returning a numerical score`
			`}`

			`public function isDecayed($model, $attribute, $score)`
			`{`
chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			`// algorithm returning a boolean stating`
			`// if the attribute is expired or not`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			`}`
			`}`
			`?>`
			```

			`# Outcomes`
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			`Evaluating quality and freshness of IOCs is a problem commonly found in Threat Intelligence Platforms. We tried to solve it using a simple yet customizable system.`

chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			`Upon release, MISP will be shipped with few models that could fit most use-cases. Still, we are eagerly waiting for contributions, fine-tunings or feedbacks from users. This would opens up plenty of opportunities including improved Models' precision, parameters tweaking or even integration of machine learning as a new Model algorithm.`
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00
			`Furthermore, we are not done yet! There are already improvements cooking in the MISP-Project oven,`
			- Integration of ``False Positive`` and ``Expiration`` Sightings
chg: [blog] post updated about decaying indicators 2019-09-10 15:11:01 +02:00			- Formula tweaking to provide better control on how to reset the ``base_score`` once a Sighting is created
chg: [blog] new blog post decaying indicators 2019-09-10 14:38:51 +02:00			- Per-user Taxonomies' ``numerical_value`` overrides
chg: [blog] decaying indicator blog post updated 2019-09-10 14:54:00 +02:00			`- Weights on Tag's predicate level`