-
Notifications
You must be signed in to change notification settings - Fork 0
fire counts should be by day #2
Comments
We'll store the full counts in the database; when do we the aggregation we can set anything above 1 down to 1 to get a count of days that had fires. This is more flexible, as we always have the "how many were counted" information around. 1 and 0 beats true and false since we can |
This is really interesting. The current code may actually reflect the data Let A be the event of brightness >= 300K, let B be the event that On Wed, Jan 25, 2012 at 5:20 PM, robinkraft <
|
Interesting ... So currently the period aggregation adds up all the fires that happened in a given period. Each detection is stored separately, and a detection's rule attributes (>=330, >=50, both, no filter) add to the period's counts for each of these rules. Does this reflect what you're describing above? Aggregating 1/0 by day before aggregating by period is similar, but without explicit double counting. Yes a fire could be detected at 11pm and 1am and get counted on different days, but that seems less likely. We could test this. I guess the question is: Do we care about double counting? Or put another way, does knowing that a fire was detected more than once on the same day add meaningful information to the system? It's possible that multiple detections in a day reflect multiple fires within a 1km pixel, but to me it seems that the intensity and duration of fires is more meaningful than the raw count. Here's the aggregation function for your reference. https://github.com/sritchie/forma-clj/blob/develop/src/clj/forma/schema.clj#L106 |
Thanks. I think, for now, this reflects what we'd like to do. There are a On Wed, Jan 25, 2012 at 6:12 PM, robinkraft <
|
Is this still an issue? |
It's tabled for now, but it's something we might want to revisit. I'm not convinced raw fire counts are the right way to go, but at the end of the day if 500m data look good then I guess it doesn't matter. That said, the other day I learned about a 250m fires product that's supposed to be coming out soon. It's based on MODIS thermal anomalies but has some additional secret sauce. On Feb 4, 2012, at 3:55 PM, Sam Ritchie wrote:
|
@robinkraft is this still an issue? should we reframe, given today's convo? |
I think it's still an issue, even if we use additional info on fires like recentness (#125). There's no way to distinguish fires detected twice vs. separate fires the same day. I recommend switching to simply removing duplicate fire days. |
The fires data are combined from Terra and Aqua. For a given pixel and fire, the fire could be included twice or more in the same day if both sensors picked it up. Indeed, in some places the same sensor might pass over twice - once during the day, once at night. Additional detections don't necessarily represent additional fires, so it seems best to have booleans for the different fire filter rules for a given day.
Corner case: a fire is picked up by Terra with 335k and 35 confidence. It's picked up by Aqua with 329k and 54 confidence. So we need to decide if we should count ANY detection above the thresholds as a detection for that day, or whether we only count a fire day if there is a fire that meets the threshold of interest. In the above case, for one fire we'd get a hit on the >=330k filter, and one for the >= 50 confidence filter, but no hit for the >=330k AND >= 50 conf case.
I vote for having a boolean be flipped to true for a given day as each criteria is met, even if by "different" fire detections. But I don't have a theoretical basis for thinking that, it just feels more right.
The text was updated successfully, but these errors were encountered: