Add Test Codes #54

RexWzh · 2024-12-19T09:44:08Z

Add Test Codes

Hi @lenianiva, could you review the test setup in this PR?

Changes Made

~~The other PR's tests passed successfully.~~ Some of the statements need fixing, check the LSP here as commented before.
Added pytest framework, migrate unitests from server.py to test_server.py

You can run the test suite with:

pytest -s tests/

Next Steps

~~Add GitHub workflow for automated testing~~
Fix bugs appears in the action, checked by pytest tests -m error

For error cases, we could fix in a new PR to simplify the review process?

Please let me know if you'd like any further adjustments.

pyproject.toml

experiments/minif2f/test.jsonl

lenianiva · 2024-12-20T07:16:35Z

I'm a bit concerned about adding minif2f to the unit testing pipeline. Building mathlib takes a while. If it is just mathlib we could lake exe cache get but Pantograph has to work in a mathlib-independent way. How long does it take to build the library?

Can you set it up so that instead of using an example skeleton project with mathlib as a dependency, the minif2f tests directly run with project root pointing to Mathlib? This way we can leverage lake exe cache get and it would run much faster.

RexWzh · 2024-12-20T20:27:55Z

Can you set it up so that instead of using an example skeleton project with mathlib as a dependency, the minif2f tests directly run with project root pointing to Mathlib? This way we can leverage lake exe cache get and it would run much faster.

You might approve this PR to run the actions, or check it here https://github.com/Lean-zh/PyPantograph/pull/2/checks

RexWzh · 2024-12-20T20:29:52Z

tests/test_minif2f.py

+@pytest.mark.advance
+def test_load_theorem(minif2f_server: Server, minif2f_test: DataFrame, minif2f_valid: DataFrame):
+    """Comprehensive test for loading multiple theorems.
+    use pytest -m "not advance" to skip this test.


use pytest -m "not advance" instead to skip some tests if it takes too long.

lenianiva · 2025-01-11T18:32:16Z

Can I push onto your dev branch?

RexWzh · 2025-01-12T07:13:57Z

Can I push onto your dev branch?

Yes, please, I think you have the write permissions on this dev branch.

Details about the tests:

Tests expected to fail for version 4.12.0 are marked as @pytest.mark.error.

For the updated minif2f dataset, you can review the corrected results here:

Test Dataset
Validation Dataset

Or load and inspect the results on your local machine using the code snippet provided below:

import re
import pandas as pd

# Load MiniF2F test and validation datasets
mini12_test = pd.read_json('experiments/minif2f/test.jsonl', lines=True)
mini12_valid = pd.read_json('experiments/minif2f/valid.jsonl', lines=True)

# Define the default Lean header
default_header = """import Mathlib
open BigOperators Real Nat Topology"""

# Write the test dataset to a Lean file
with open('formal_test.lean', 'w') as f:
    f.write(default_header + '\n\n')
    text = '\n\n'.join(mini12_test.formal_statement.apply(lambda thm: f"{thm} := by sorry"))
    f.write(text)

# Write the validation dataset to a Lean file
with open('formal_valid.lean', 'w') as f:
    f.write(default_header + '\n\n')
    text = '\n\n'.join(mini12_valid.formal_statement.apply(lambda thm: f"{thm} := by sorry"))
    f.write(text)

lenianiva · 2025-01-12T17:06:57Z

Why did you delete poetry.lock?

RexWzh · 2025-01-12T17:17:19Z

Why did you delete poetry.lock?

I removed poetry.lock since it caused test failures. I think the pyproject.toml is sufficient for version tracking. I can add back my local lock file if you prefer.

https://github.com/Lean-zh/PyPantograph/actions/runs/12732921282/job/35488677656

lenianiva · 2025-01-13T15:49:04Z

Why did you delete poetry.lock?

I removed poetry.lock since it caused test failures. I think the pyproject.toml is sufficient for version tracking. I can add back my local lock file if you prefer.

https://github.com/Lean-zh/PyPantograph/actions/runs/12732921282/job/35488677656

The lock file should not be removed. If having the lock file causes your unit tests to fail something's seriously messed up with your setup.

RexWzh · 2025-01-13T16:20:51Z

I tried not to add your review work, so I committed the lock file from your branch instead. It was a bit of a clumsy attempt, though. What I meant is that this file is machine-generated and doesn’t need to be tracked in version control. It’s better to let each environment generate its own lock file as needed.

RexWzh · 2025-01-14T01:37:41Z

This is a non-essential feature, and it feels like we could be losing some development efficiency by getting caught up in these details. As mentioned before, I’m excited to see people willing to contribute to open-source projects. I’ve learned a lot from the Open Source Promotion Plan, and I hope to contribute to Lean's projects through lean-zh. Perhaps we could focus on something more impactful for this project to start with.

lenianiva · 2025-01-14T06:34:51Z

I think this is a valuable contribution but we need to refactor out all the clutter e.g. the experiments out

RexWzh commented Dec 19, 2024

View reviewed changes

pyproject.toml Show resolved Hide resolved

RexWzh commented Dec 19, 2024

View reviewed changes

experiments/minif2f/test.jsonl Outdated Show resolved Hide resolved

RexWzh marked this pull request as ready for review December 20, 2024 20:16

RexWzh commented Dec 20, 2024

View reviewed changes

RexWzh force-pushed the dev branch from 6d3c88d to 9968979 Compare December 20, 2024 21:24

RexWzh added 8 commits December 23, 2024 15:28

add pytest dependency

ea2b885

add test code for minif2f

4fbb6ad

update minif2f from purewhite

22b215a

add github action

02799cf

migrate tests from server.py

1dc7a96

add three error tests

366f9ec

fix minif2f statements

e934e87

add pytest marks

70e3a72

minif2f: fix & check

52aae67

RexWzh force-pushed the dev branch from ee7fe13 to 52aae67 Compare January 12, 2025 06:57

RexWzh added 3 commits January 12, 2025 15:20

Merge remote-tracking branch 'origin/main' into dev

4788a78

update tests

268ee54

ignore poetry.lock

affc15a

fix json dumps in server.py

ee9e70b

add pylock

c481532

RexWzh marked this pull request as draft January 13, 2025 16:24

RexWzh closed this Jan 14, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add Test Codes #54

Add Test Codes #54

RexWzh commented Dec 19, 2024 •

edited

Loading

lenianiva commented Dec 20, 2024 •

edited

Loading

RexWzh commented Dec 20, 2024

RexWzh Dec 20, 2024

lenianiva commented Jan 11, 2025

RexWzh commented Jan 12, 2025

lenianiva commented Jan 12, 2025

RexWzh commented Jan 12, 2025

lenianiva commented Jan 13, 2025

RexWzh commented Jan 13, 2025

RexWzh commented Jan 14, 2025

lenianiva commented Jan 14, 2025

Add Test Codes #54

Add Test Codes #54

Conversation

RexWzh commented Dec 19, 2024 • edited Loading

Add Test Codes

Next Steps

lenianiva commented Dec 20, 2024 • edited Loading

RexWzh commented Dec 20, 2024

RexWzh Dec 20, 2024

Choose a reason for hiding this comment

lenianiva commented Jan 11, 2025

RexWzh commented Jan 12, 2025

lenianiva commented Jan 12, 2025

RexWzh commented Jan 12, 2025

lenianiva commented Jan 13, 2025

RexWzh commented Jan 13, 2025

RexWzh commented Jan 14, 2025

lenianiva commented Jan 14, 2025

RexWzh commented Dec 19, 2024 •

edited

Loading

lenianiva commented Dec 20, 2024 •

edited

Loading