New common check: Helper functions should be private #207

jiegillet · 2021-10-22T15:20:35Z

Closes #201.
This PR ended up being kind of huge, sorry about that.

The purpose of the new common check is to see if helper functions have been mistakenly defined as public functions. To do this, we can check again the exemploid (yes, I ended up using that word). So far, we could only read concept exercise exemplars, so I had to include practice exercises too. Then one thing lead to another:

I added a %Source{} struct that holds all the exercise input (solution, exemploid, slug, paths...)
I changed the analyzer to read all that data before analysis and pass it all in an argument + added tests (in test_data for missing files etc)
I added the check for public helper functions and tests (~~I could only use test_data since we need an exemploid~~)
I fixed a bunch of other tests that had public helpers
I modified ExerciseTestCase so it could figure out the source info from the module name
I fixed a small bug: the compiler warnings showed the compile file as nofile and that always bugged me. Now that the common checks have access to %Source{} it was easy to add the correct file name.

Left to do

website-copy comment
Add tests for public helper functions, concept and practice.

I changed my mind on check_source, I think I would now prefer passing it the %Source{} so that users have access to everything. I will probably do that in a different PR, because this one is big enough. I also have to add docs.

angelikatyborska · 2021-10-26T16:24:35Z

The bigger the PR, the longer the wait until I have enough brain power for a review 🙈

angelikatyborska

I'm submitting a partial review, I'll continue another day.

I will probably do that in a different PR, because this one is big enough. I also have to add docs.

Yes please, separate PR 😅

lib/elixir_analyzer.ex

lib/elixir_analyzer/exercise_test/common_checks/compiler_warnings.ex

lib/elixir_analyzer/source.ex

test_data/two_fer/missing_example_solution/lib/two_fer.ex

jiegillet · 2021-10-27T02:37:15Z

lib/elixir_analyzer.ex

+    {exercice_type, exemploid_path} =
+      case meta_config["files"] do
+        %{"exemplar" => [path | _]} -> {:concept, Path.join(params.path, path)}
+        %{"example" => [path | _]} -> {:practice, Path.join(params.path, path)}


I wonder about these:

relative_code_path = meta_config["files"]["solution"] |> hd() [...] %{"exemplar" => [path | _]} -> {:concept, Path.join(params.path, path)} %{"example" => [path | _]} -> {:practice, Path.join(params.path, path)}

After a quick check, it looks like all solution/example/exemplar values in all config.json only have one file in there. But that doesn't mean that we are capturing the full code. There are some exercises with the editor field, and students that submit solutions via the CLI may add more files.

I'll open an issue after this PR gets merged so we can form a plan.

@neenjaw do I remember correctly that your initial assumption was to just ignore multi file submissions? I think that was totally reasonable as they are very rare. It would be cool to at least get the basic common checks working for multi file solutions (e.g. proper name casing, indentation) but I wouldn't spend too much effort on making it perfect.

When I wrote that line of code, example files were only a single file. If that's not the case anymore, then makes sense to change strategy.

Example files are still a single file, but the checks only look at the file mentioned in meta_config["files"]["solution"]. In case students submit several files, they are not looked at. And might actually get a compiler error as an analyzer comment, now that I think about it. Could you re-submit one of your multiple file solutions and check?

Compiled elixir is still dynamically linked at run time so it just needs to be legal syntax, not runnable code, for code to compile successfully.

iex(1)> Code.compile_string(""" ...(1)> defmodule MyModule do ...(1)> def foo do ...(1)> SomeNonExistentModule.bar(2) ...(1)> end ...(1)> end ...(1)> """) [ {MyModule, <<70, 79, 82, 49, 0, 0, 4, 236, 66, 69, 65, 77, 65, 116, 85, 56, 0, 0, 0, 171, 0, 0, 0, 16, 15, 69, 108, 105, 120, 105, 114, 46, 77, 121, 77, 111, 100, 117, 108, 101, 8, 95, 95, 105, 110, 102, 111, ...>>} ]

That's true enough, although not foolproof.

iex(42)> Code.compile_string(""" ...(42)> defmodule MyModule do ...(42)> import SomeNonExistentModule ...(42)> def foo do ...(42)> bar(2) ...(42)> end ...(42)> end ...(42)> """ ) ** (CompileError) nofile:2: module SomeNonExistentModule is not loaded and could not be found

jiegillet · 2021-10-27T03:02:17Z

The bigger the PR, the longer the wait until I have enough brain power for a review 🙈

Same here, it's all good, take your time this isn't urgent.

lib/elixir_analyzer/exercise_test/common_checks/private_helper_functions.ex

test/elixir_analyzer/exercise_test/common_checks/private_helper_functions_test.exs

lib/elixir_analyzer/exercise_test/common_checks/private_helper_functions.ex

test/elixir_analyzer/test_suite/freelancer_rates_test.exs

angelikatyborska · 2021-10-31T14:53:08Z

Something is weird with CI. I tried merging main to see if it helps but it didn't:

== Compilation error in file test/elixir_analyzer/exercise_test/assert_call/erlang_modules_test.exs ==
** (File.Error) could not list directory "elixir/exercises/concept": no such file or directory
    (elixir 1.12.1) lib/file.ex:1590: File.ls!/1
    (elixir_analyzer 0.1.0) test/support/exercise_test_case.ex:181: ElixirAnalyzer.ExerciseTestCase.find_source_type/1
    (elixir_analyzer 0.1.0) test/support/exercise_test_case.ex:166: ElixirAnalyzer.ExerciseTestCase.find_source/1
    test/elixir_analyzer/exercise_test/assert_call/erlang_modules_test.exs:2: (module)
    (stdlib 3.15) erl_eval.erl:685: :erl_eval.do_apply/6
    (elixir 1.12.1) lib/kernel/parallel_compiler.ex:428: Kernel.ParallelCompiler.require_file/2
    (elixir 1.12.1) lib/kernel/parallel_compiler.ex:321: anonymous fn/4 in Kernel.ParallelCompiler.spawn_workers/7

jiegillet · 2021-11-01T08:50:20Z

Something is weird with CI. I tried merging main to see if it helps but it didn't:

I rebased everything on main, and it seems to work. Or was it you? 🤷‍♂️

angelikatyborska · 2021-11-01T18:49:35Z

@jiegillet It doesn't work, see failing external tests: https://github.com/exercism/elixir-analyzer/runs/4065745282?check_suite_focus=true

jiegillet · 2021-11-02T00:57:32Z

Oh you're right. I changed the CI for the internal tests, but I forgot the external tests. I guess they external tests don't need the submodule, but compilation does. It wasn't an issue so far because if the concept exercise folder wasn't found, it assumed it was a practice exercise and moved on. However now it's required.

test/elixir_analyzer/exercise_test/common_checks/private_helper_functions_test.exs

angelikatyborska · 2021-11-02T18:40:37Z

I opened a PR with the comment change because I forgot you said you will do it 🤦 exercism/website-copy#2118

I left two final suggestions about the new check. Everything else looks great. I didn't review the refactoring with my regular care and attention to detail - I trust the tests to catch bugs and I trust you that it makes sense conceptually because by now you know this project better than I do 😁

angelikatyborska · 2021-11-02T19:00:56Z

Ok, after reviewing the elixir repo PR, I am now wondering what about extra modules like this:

defmodule Rules do
  defmodule BooleanLogic do
    def do_or(left, right), do: left or right
  end

  def score?(touching_power_pellet, touching_dot) do
    BooleanLogic.do_or(touching_power_pellet, touching_dot)
  end
end

This will trigger a comment. So we will effectively complaining about any multi-module solution 🤔

jiegillet · 2021-11-03T01:18:50Z

Ok, after reviewing the elixir repo PR, I am now wondering what about extra modules like this:
defmodule Rules do
  defmodule BooleanLogic do
    def do_or(left, right), do: left or right
  end

  def score?(touching_power_pellet, touching_dot) do
    BooleanLogic.do_or(touching_power_pellet, touching_dot)
  end
end
This will trigger a comment. So we will effectively complaining about any multi-module solution 🤔

That's true. How about then I detect the @doc false trick in my implementation?
We already comment about that trick in the analyzer comment, we could change it slightly to mention that this is a viable option if they want to have multi-module solutions. I haven't reviewed your website-copy PR yet, in case we want to do this.

jiegillet · 2021-11-03T05:49:37Z

I ended up hiding function behind @doc false and @impl true. If you think it's not necessary, I can revert the last commit.

neenjaw · 2021-11-04T06:38:29Z

Ok, I'm missing some details about this PR.

Why does @doc false and @impl true hide functions?

Doesn't this pr only check to make sure that the solution module only has the public function required by the stub be public, the rest private? (I haven't inspected the details, but that is the issue this pr intends to close, correct?)

Why would we also throw a warning about a sub module?

jiegillet · 2021-11-04T07:53:01Z

Doesn't this pr only check to make sure that the solution module only has the public function required by the stub be public, the rest private? (I haven't inspected the details, but that is the issue this pr intends to close, correct?)

Yes, that is the intent, but read on.

Why does @doc false and @impl true hide functions?

The basic rule is that the students should only expose functions required in the tests, and everything else should be private. However in general, there are 2 situations where that's not possible:

When using GenServer, they need to implement functions like init which are not in the tests but cannot be defined privately. However, if those functions have @impl true before, it signals that this is an implementation of another module, and effectively hides the function from the public interface as well.
When using different modules, some functions must be public to be used across modules, so to hide them from the docs and the public interface you use @doc false.

I basically want to give students the same options. For the first point, the exemplar/example (which is what we use as a reference) also uses those functions, so it would be OK, but for the second point, if we want to let users define their own modules, we need to give them a way to do that without triggering the check, so we need to let then know about @doc false and take it into consideration.

Why would we also throw a warning about a sub module?

I'm not sure what you are referring to here, we have no plans of telling students about the sub module.

neenjaw · 2021-11-04T12:57:06Z

Why does @doc false and @impl true hide functions?

The basic rule is that the students should only expose functions required in the tests, and everything else should be private. However in general, there are 2 situations where that's not possible:

When using GenServer, they need to implement functions like init which are not in the tests but cannot be defined privately. However, if those functions have @impl true before, it signals that this is an implementation of another module, and effectively hides the function from the public interface as well.

If GenServer is the only case that we are aware of in a main solution module that this may be true, why not just make exceptions for functions named to match the GenServer callback functions?

While @impl true is recommended and will throw a warning if one exists and others are missing, it is not always required.

When using different modules, some functions must be public to be used across modules, so to hide them from the docs and the public interface you use @doc false.

I basically want to give students the same options. For the first point, the exemplar/example (which is what we use as a reference) also uses those functions, so it would be OK, but for the second point, if we want to let users define their own modules, we need to give them a way to do that without triggering the check, so we need to let then know about @doc false and take it into consideration.

So even in your discussion above, you called it the @doc false trick, while this prevents it from appearing in ExDoc-like instances, it's not really hidden from the public interface.

I'm not even sure this is a good suggestion to use @doc false on sub modules because it may suggest to students that any sub module should not have documentation.

It is an unfortunate/fortunate consequence of Modules always having public visibility in Elixir, so I think that's why we should try to teach good habit organizing code around contexts rather than tricks to avoid our false positive warning.

Why would we also throw a warning about a sub module?

I'm not sure what you are referring to here, we have no plans of telling students about the sub module.

I'm referring to the exchange that ends with your comment here: #207 (comment) where submodules will raise a warning unless covered by the trick.

I think I would suggest that this public/private check only exist for the main solution module with exceptions for known public function which may be implemented broadly (e.g. GenServer callback named functions) rather than for any module which I think our authority to appropriately know the scope of functions is weaker.

jiegillet · 2021-11-04T13:17:17Z

Ah, I see, I confused Elixir sub-modules with the git submodule. Those things only have the name in common :)

For the GenServer case, it should not problem at all to drop the @impl true check, since the reference implementation would use those same functions as well.

So you are suggesting that we do not check other modules at all. It certainly can be done, but kind of undermines the message of the check that is use as little public functions as possible. As you said it's a consequence of having always-public modules.

neenjaw · 2021-11-04T13:32:27Z

So you are suggesting that we do not check other modules at all. It certainly can be done, but kind of undermines the message of the check that is use as little public functions as possible. As you said it's a consequence of having always-public modules.

Yes, that is my suggestion, not because the principle doesn't stand or because it isn't a good practice, but because our ability to differentiate required interface from private helper in unknown, student written, modules is probably not trivial -- and in those situations I think it is best left to the mentor rather than raising a false positive comment or an innocent, but misleading, comment to avoid our false-positive warning.

If you can introspect to keep track of what calls are made from the solution module to student written auxiliary modules, then perhaps you can be more certain to raise the comment only when there exists a public function, not called by an external module which then there is a stronger indication that it should be private because it is only used internally.

jiegillet · 2021-11-04T15:17:52Z

If you can introspect to keep track of what calls are made from the solution module to student written auxiliary modules, then perhaps you can be more certain to raise the comment only when there exists a public function, not called by an external module which then there is a stronger indication that it should be private because it is only used internally.

Nope, not doing that, it's not worth it 😅

This reverts commit 6e5c3a7.

jiegillet · 2021-11-06T03:19:06Z

I've implemented your suggestion in the last commit..

neenjaw · 2021-11-06T03:21:02Z

mid-review

neenjaw · 2021-11-06T03:32:50Z

Changes look reasonable, adding this one test seemed to impact a lot of test files, which is interesting. I think that while this is a more conservative approach, it will yield less false-positive warnings.

I'll leave the commnt copy and merge to you and @angelikatyborska since she is code-owner and has to approve to merge

jiegillet · 2021-11-06T04:24:34Z

Thanks for the review and the suggestions, I appreciate it.

angelikatyborska

I agree with the conclusion to just ignore doing this check for modules different than the one with the solution.

Waiting with merging only after exercism/website-copy#2118 gets merged.

jiegillet added 7 commits October 21, 2021 20:51

Implement check

479cb4e

Add Source to read and hold exemploid files

d37101c

Add tests for missing/corrupted example files

03e2c8a

Send proper file name to compiler warnings

4bb879a

Read practice exercice example files in test cases

0d8d91a

Add tests, fix old tests with public helpers

3c253f1

update submodule

d71693f

jiegillet added x:module/analyzer Work on Analyzers x:type/coding Write code that is not student-facing content (e.g. test-runners, generators, but not exercises) x:size/large Large amount of work hacktoberfest-accepted labels Oct 22, 2021

jiegillet requested a review from angelikatyborska as a code owner October 22, 2021 15:20

mix format and mix credo

598e783

jiegillet mentioned this pull request Oct 26, 2021

Add Elixir analyzer comment: private helper functions exercism/website-copy#2112

Merged

angelikatyborska reviewed Oct 26, 2021

View reviewed changes

jiegillet commented Oct 27, 2021

View reviewed changes

jiegillet and others added 4 commits October 27, 2021 22:13

Typos and formatting

2b562d4

Refactor, remove undocumented :file and :module parameters

62255c8

Add tests for private helpers

6c54a27

Merge branch 'main' into jie-private-helpers

eadbd8b

angelikatyborska reviewed Oct 31, 2021

View reviewed changes

jiegillet added 2 commits November 1, 2021 17:45

Change check to consider arity

cb3af04

Rework check test to show comment details

c53427d

Update externel test CI and submodule

bec9589

angelikatyborska reviewed Nov 2, 2021

View reviewed changes

test/elixir_analyzer/exercise_test/common_checks/private_helper_functions_test.exs Outdated Show resolved Hide resolved

angelikatyborska reviewed Nov 2, 2021

View reviewed changes

test/elixir_analyzer/exercise_test/common_checks/private_helper_functions_test.exs Outdated Show resolved Hide resolved

angelikatyborska reviewed Nov 2, 2021

View reviewed changes

test/elixir_analyzer/exercise_test/common_checks/private_helper_functions_test.exs Show resolved Hide resolved

jiegillet added 4 commits November 3, 2021 10:20

typo

6ad53bd

Update submodule with cleaned up exemploids

7de9006

Arguments to _, show first failure

4667a77

Hide functions after @doc false or @impl true

6e5c3a7

jiegillet added 2 commits November 5, 2021 21:38

Revert "Hide functions after @doc false or @impl true"

6584333

This reverts commit 6e5c3a7.

Annotate AST to keep track of modules

d6d9397

neenjaw approved these changes Nov 6, 2021

View reviewed changes

angelikatyborska approved these changes Nov 6, 2021

View reviewed changes

jiegillet merged commit 46ba1b4 into exercism:main Nov 6, 2021

jiegillet deleted the jie-private-helpers branch November 6, 2021 13:14

This was referenced Jul 14, 2022

Document reputation label exercism/docs#347

Merged

Consider correct reputation amounts for different labels exercism/exercism#6440

Open

Consider correct reputation amounts for different labels exercism/exercism#6441

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

New common check: Helper functions should be private #207

New common check: Helper functions should be private #207

jiegillet commented Oct 22, 2021 •

edited

Loading

angelikatyborska commented Oct 26, 2021

angelikatyborska left a comment

jiegillet Oct 27, 2021

angelikatyborska Oct 31, 2021

neenjaw Oct 31, 2021

jiegillet Nov 1, 2021

neenjaw Nov 6, 2021

jiegillet Nov 6, 2021

jiegillet commented Oct 27, 2021

angelikatyborska commented Oct 31, 2021

jiegillet commented Nov 1, 2021

angelikatyborska commented Nov 1, 2021

jiegillet commented Nov 2, 2021

angelikatyborska commented Nov 2, 2021

angelikatyborska commented Nov 2, 2021

jiegillet commented Nov 3, 2021

jiegillet commented Nov 3, 2021

neenjaw commented Nov 4, 2021 •

edited

Loading

jiegillet commented Nov 4, 2021 •

edited

Loading

neenjaw commented Nov 4, 2021

jiegillet commented Nov 4, 2021

neenjaw commented Nov 4, 2021 •

edited

Loading

jiegillet commented Nov 4, 2021

jiegillet commented Nov 6, 2021

neenjaw commented Nov 6, 2021

neenjaw commented Nov 6, 2021 •

edited

Loading

jiegillet commented Nov 6, 2021

angelikatyborska left a comment

New common check: Helper functions should be private #207

New common check: Helper functions should be private #207

Conversation

jiegillet commented Oct 22, 2021 • edited Loading

angelikatyborska commented Oct 26, 2021

angelikatyborska left a comment

Choose a reason for hiding this comment

jiegillet Oct 27, 2021

Choose a reason for hiding this comment

angelikatyborska Oct 31, 2021

Choose a reason for hiding this comment

neenjaw Oct 31, 2021

Choose a reason for hiding this comment

jiegillet Nov 1, 2021

Choose a reason for hiding this comment

neenjaw Nov 6, 2021

Choose a reason for hiding this comment

jiegillet Nov 6, 2021

Choose a reason for hiding this comment

jiegillet commented Oct 27, 2021

angelikatyborska commented Oct 31, 2021

jiegillet commented Nov 1, 2021

angelikatyborska commented Nov 1, 2021

jiegillet commented Nov 2, 2021

angelikatyborska commented Nov 2, 2021

angelikatyborska commented Nov 2, 2021

jiegillet commented Nov 3, 2021

jiegillet commented Nov 3, 2021

neenjaw commented Nov 4, 2021 • edited Loading

jiegillet commented Nov 4, 2021 • edited Loading

neenjaw commented Nov 4, 2021

jiegillet commented Nov 4, 2021

neenjaw commented Nov 4, 2021 • edited Loading

jiegillet commented Nov 4, 2021

jiegillet commented Nov 6, 2021

neenjaw commented Nov 6, 2021

neenjaw commented Nov 6, 2021 • edited Loading

jiegillet commented Nov 6, 2021

angelikatyborska left a comment

Choose a reason for hiding this comment

jiegillet commented Oct 22, 2021 •

edited

Loading

neenjaw commented Nov 4, 2021 •

edited

Loading

jiegillet commented Nov 4, 2021 •

edited

Loading

neenjaw commented Nov 4, 2021 •

edited

Loading

neenjaw commented Nov 6, 2021 •

edited

Loading