Add network locality fields #288

andrewkroh · 2018-12-18T06:57:08Z

This adds network.locality that simply has a value of either private or public. If either source.ip or destination.ip are public IP addresses then network.locality is elevated to "public". Otherwise if both source.ip and destination.ip are non-public then network.locality is private.

The IPv4 and IPv6 ranges that are considered private are strictly specified in the definition of network.locality.

This is a useful means of filtering on flows. Some common queries I use are

network.locality:public and source.locality:public
network.locality:public and destination.locality:public

MikePaquette

@andrewkroh this seems useful - I have a few questions please:

If a user has available a list of internally managed IP addresses, even if routable, (e.g., via Infoblox) should those addresses be considered private as well? or just the ones specified by RFC's listed.? (I think this could be the distinction between "internal" and "private")
If private == internal, then could this information be represented in the network.direction field? It seems that:
network.direction: external == source.locality: public and destination.locality: public
network.direction: internal == source.locality: private and destination.locality: private
(but I think "inbound" and "outbound" will get us back into the troubling discussion about how to determine/define this.)
In your example, isn't network.locality:public and source.locality:public == source.locality:public ?
If we go forward with these fields, should we also have client.locality and server.locality for performing similar filtering on bidirectional flows? If so, then we'd have to decide which one populates the network.locality field in the case where they conflict. (i.e. when source.* <> client.*)

andrewkroh · 2019-02-23T02:57:31Z

If a user has available a list of internally managed IP addresses, even if routable, (e.g., via Infoblox) should those addresses be considered private as well? or just the ones specified by RFC's listed.? (I think this could be the distinction between "internal" and "private")

I'd like for this to strictly be based on the RFCs which makes it very easy to implement everywhere since it doesn't require much configuration.

If private == internal, then could this information be represented in the network.direction field?

What constitutes "internal" and "external" has some flexibility which is great. If someone treats internal as the RFC private addresses then yes, they are equivalent. You get a difference when if you configure your "internal" ranges to be a subset of the private address space like only 10.20.0.0/16 and you have outbound traffic to some other private address. You also get a difference if you have a mix of public and private addresses that you treat as "internal".

In your example, isn't network.locality:public and source.locality:public == source.locality:public ?

Indeed it is, my bad. I was trying to show that you could infer some directionality based on the query and meant to use a private in there. Like network.locality:public and source.locality:private is probably a flow initiated from the inside to some public facing internet service.

... If so, then we'd have to decide which one populates the network.locality field in the case where they conflict. (i.e. when source.* <> client.*)

I agree that we should decide on a precedence. I'm just have trouble imagining cases where the two sets of client/server and source/destination addresses are different (maybe DHCP). As long as client/server and source/destination are the same sets of addresses the computation for network.locality always works out the same.

Let's say that source/destination take precedence?

This adds `network.locality` that simply has a value of either private or public. If either `source.ip` or `destination.ip` are public IP addresses then network.locality is elevated to "public". Otherwise if both `source.ip` and `destination.ip` are non-public then `network.locality` is private. The IPv4 and IPv6 ranges that are considered private are strictly specified in the definition of `network.locality`. This is a useful means of filtering on flows. Some common queries I use are - network.locality:public and source.locality:public - network.locality:public and destination.locality:public

andrewkroh · 2019-03-08T02:49:10Z

I've changed my thinking on this after seeing a user request to make the network ranges configurable. I'm thinking it's better to keep the source/destination.locality fields in sync with the network.direction field.

source.locality	destination.locality	network.locality
internal	internal	internal
internal	external	outbound
external	internal	inbound
external	external	external
		unknown

You can see how this would be used in elastic/beats#11147.

andrewkroh · 2019-03-15T16:28:59Z

The general idea behind having both a network.direction and a network.locality is to allow monitoring tools to report the direction that they observed (like Zeek running on a desktop) and to also be able to classify that traffic at a higher level (like classifying the traffic against all company owned networks).

For example, a server connects to another server on the company's network. A network monitoring tool running on that server sets network.direction: outbound for that connection. The network.locality field would be set to internal since both the source and destination are internal to the company's network.

webmat

@andrewkroh @MikePaquette I like this proposal, and I'd like to get this in soon.

Andrew, is the PR still in line with your latest thinking, as described in comments from March and your proposal in Beats?

I can help rebase/merge the PR if you'd like. Much has changed in the last little while.

If you attempt the merge yourself, you'll have to add a short: description as well

webmat · 2019-04-23T20:59:04Z

CHANGELOG.md

@@ -20,6 +20,7 @@ All notable changes to this project will be documented in this file based on the

 * Added pointer in description of `http` field set to `url` field set. #330
 * Added an optional short field description. #330
+* Add `network.locality`, `source.locality`, and `desination.locality`. #288


There's a typo in destination, and please add the new fields as well, under client & server

andrewkroh · 2019-04-24T21:06:39Z

Andrew, is the PR still in line with your latest thinking, as described in comments from March and your proposal in Beats?

Yes, it is. And if you want to take over that would be much appreciated.

webmat · 2019-04-25T14:11:12Z

You got it! 👍

webmat · 2019-04-29T18:53:59Z

@andrewkroh I have a branch where the code of this PR has been ported on top of master.

However looking back at the later comments here, I realize that the PR code uses values "private" and "public", whereas this table and issue elastic/beats#11147 both talk about values "internal" and "external" for network.locality, source.locality and destination.locality...

Another inconsistency between the PR and the discussion is that the PR states that the values for locality should be based on the RFCs, but the thinking seems to be that they should be configurable, by users based on their network address ranges.

So I have a few things I'd like to discuss:

Which pairs of values should we use for the 5 locality fields?
- private/public
- internal/external
Should we separate the concept of RCF private/public from the concept of whether an IP range (public or not) is under the user's control?
If we introduce the concept of configurability wrt one's network for these fields, people will want levels of trust of these "trusted" ranges, as well. I've had this discussion with a few people already.

I do see the value in having the configurability. But my take is that starting with RFC & no configurability keeps it straightforward and useful, while avoiding the opening of pandora's box of "how much is the network trusted".

In other words, my answer to Q 2 would be we separate the concepts for now, and when tackling the configurability, we also tackle the trust levels.

I'm curious what you think about this.

dainperkins · 2019-08-27T01:30:52Z

some thoughts:

public/private are specific concepts and should be used as such. Internal / external are significantly more relevant in terms of security, apm, etc. but will, at the very least require the definition and lookup of internal public addresses, and external private addresses (assume all undefined are private/internal or public/external)
absolutely, they are not mutually inclusive or exclusive (at any level, plenty of vpn B2B use private addresses on each side)
network.risk.score, network.risk.tag :)

additionally, looking at e.g. NAPM having some concepts of asset type categorization of network zones would be extremely useful (network.tag or similar would be useful for describing network "subnets" in relation to physical locations in the organization, but network.location (maybe internal geoip additions? or pipelines to track ips to a network name/cidr index)

an array for network.tag could hold any of the 4 (pub/prv/int/ext), but a network.location could also hold organizational data identifying a specific location (Houston-DMZ, Data-Center, WAN-Hub, GCloud-Project, etc)

Add one for source & destination and suddenly theres the possibility of n/apm (adding network info to APM information for troubleshooting - e.g. User -> (fw to WAF) -> (WAF to array of web Servers).. track the # of resets/retransmits, or QOS info across the various WEB servers to troubleshoot issues at the network level...)

andrewkroh added the review label Dec 18, 2018

adriansr mentioned this pull request Dec 18, 2018

[Meta] Filebeat NetFlow input elastic/beats#9399

Closed

7 tasks

ruflin requested review from robgil and MikePaquette December 18, 2018 09:11

MikePaquette reviewed Dec 18, 2018

View reviewed changes

andrewkroh added 2 commits February 22, 2019 22:00

Add client.locality and server.locality

b7895d1

andrewkroh force-pushed the feature-network-locality branch from 098d373 to b7895d1 Compare February 23, 2019 03:07

webmat reviewed Apr 23, 2019

View reviewed changes

andrewkroh closed this Oct 21, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add network locality fields #288

Add network locality fields #288

andrewkroh commented Dec 18, 2018 •

edited

Loading

MikePaquette left a comment

andrewkroh commented Feb 23, 2019 •

edited

Loading

andrewkroh commented Mar 8, 2019 •

edited

Loading

andrewkroh commented Mar 15, 2019

webmat left a comment

webmat Apr 23, 2019

andrewkroh commented Apr 24, 2019

webmat commented Apr 25, 2019

webmat commented Apr 29, 2019

dainperkins commented Aug 27, 2019

Add network locality fields #288

Add network locality fields #288

Conversation

andrewkroh commented Dec 18, 2018 • edited Loading

MikePaquette left a comment

Choose a reason for hiding this comment

andrewkroh commented Feb 23, 2019 • edited Loading

andrewkroh commented Mar 8, 2019 • edited Loading

andrewkroh commented Mar 15, 2019

webmat left a comment

Choose a reason for hiding this comment

webmat Apr 23, 2019

Choose a reason for hiding this comment

andrewkroh commented Apr 24, 2019

webmat commented Apr 25, 2019

webmat commented Apr 29, 2019

dainperkins commented Aug 27, 2019

andrewkroh commented Dec 18, 2018 •

edited

Loading

andrewkroh commented Feb 23, 2019 •

edited

Loading

andrewkroh commented Mar 8, 2019 •

edited

Loading