Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Adds same glob patterns as exclusions for pageview goals #750

Merged
merged 9 commits into from
Mar 2, 2021
17 changes: 11 additions & 6 deletions lib/plausible/goal/schema.ex
Original file line number Diff line number Diff line change
Expand Up @@ -18,17 +18,22 @@ defmodule Plausible.Goal do
end

defp validate_event_name_and_page_path(changeset) do
if present?(changeset, :event_name) || present?(changeset, :page_path) do
if validate_page_path(changeset) || validate_event_name(changeset) do
changeset
else
changeset
|> add_error(:event_name, "this field is required")
|> add_error(:page_path, "this field is required")
|> add_error(:event_name, "this field is required and cannot be blank")
|> add_error(:page_path, "this field is required and must start with a /")
end
end

defp present?(changeset, field) do
value = get_field(changeset, field)
value && value != ""
defp validate_page_path(changeset) do
value = get_field(changeset, :page_path)
value && String.match?(value, ~r/^\/.*/)
Copy link
Contributor Author

@Vigasaurus Vigasaurus Feb 26, 2021

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'd like to do better validation of pathnames here, and add good validation for preventing duplicate goals; but I didn't want to make a disgusting regex if you knew of an easy Elixir helper for validating URLs (this may not work I guess because of the globs), and I couldn't get the latter to work.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I think it's OK as is.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

What do you think about it to extract this regex and put in the module attribute? And use a better name to describe the intentions of what this regex do...

end

defp validate_event_name(changeset) do
value = get_field(changeset, :event_name)
value && String.match?(value, ~r/^.+/)
end
end
41 changes: 27 additions & 14 deletions lib/plausible/stats/clickhouse.ex
Original file line number Diff line number Diff line change
Expand Up @@ -863,24 +863,37 @@ defmodule Plausible.Stats.Clickhouse do
end

defp fetch_pageview_goals(goals, site, query) do
pages =
goals =
Enum.map(goals, fn goal -> goal.page_path end)
|> Enum.filter(& &1)

if Enum.count(pages) > 0 do
q =
from(
e in base_query_w_sessions(site, query),
where: fragment("? IN tuple(?)", e.pathname, ^pages),
group_by: e.pathname,
select: %{
name: fragment("concat('Visit ', ?) as name", e.pathname),
count: fragment("uniq(user_id) as count"),
total_count: fragment("count(*) as total_count")
}
)
if Enum.count(goals) > 0 do
regex_goals =
Enum.map(goals, fn g ->
"(^#{g}\/?$)"
Vigasaurus marked this conversation as resolved.
Show resolved Hide resolved
|> String.replace(~r/\*\*/, ".*")
|> String.replace(~r/(?<!\.)\*/, "[^/]*")
end)

ClickhouseRepo.all(q)
from(
e in base_query_w_sessions(site, query),
where:
fragment(
"notEmpty(multiMatchAllIndices(?, array(?)) as indices)",
Vigasaurus marked this conversation as resolved.
Show resolved Hide resolved
e.pathname,
^regex_goals
),
select: %{
index: fragment("arrayJoin(indices) as index"),
count: fragment("uniq(user_id) as count"),
total_count: fragment("count(*) as total_count")
},
group_by: fragment("index")
)
|> ClickhouseRepo.all()
|> Enum.map(fn x ->
Map.put(x, :name, "Visit #{Enum.at(goals, x[:index] - 1)}") |> Map.delete(:index)
end)
else
[]
end
Expand Down