Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

478 update main #115

Merged
merged 52 commits into from
Nov 6, 2024
Merged

478 update main #115

merged 52 commits into from
Nov 6, 2024

Conversation

lhubbardONS
Copy link
Collaborator

Summary

Add your summary here - keep it brief, to the point, and in plain English.

Checklists

This pull request meets the following requirements:

  • installable with all dependencies recorded
  • runs without error
  • follows PEP8 and project specific conventions
  • appropriate use of comments, for example no descriptive comments
  • functions documented using Numpy style docstings
  • assumptions and decisions log considered and updated if appropriate
  • unit tests have been updated to cover essential functionality for a reasonable range of inputs and conditions
  • other forms of testing such as end-to-end and user-interface testing have been considered and updated as required
  • tests suite passes (locally as a minimum)
  • peer reviewed with review recorded

If you feel some of these conditions do not apply for this pull request, please
add a comment to explain why.

@Jday7879 Jday7879 requested a review from AntonZogk November 5, 2024 10:04
@AntonZogk AntonZogk marked this pull request as ready for review November 5, 2024 10:59
from rdsa_utils.cdp.helpers.s3_utils import load_json


def dfs_from_spp(
Copy link
Collaborator

@AntonZogk AntonZogk Nov 5, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

function name should have a verb, read_dfs_from_spp perhaps? or something like this

return mapper


def read_and_combine_colon_sep_files(folder_path, column_names, config):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

missing docs


Returns
-------
_type_
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

missing type

output_path = config["output_path"]
file_version_mbs = metadata.metadata("monthly-business-survey-results")["version"]
snapshot_name = config["mbs_file_name"].split(".")[0]
imputation_filename = f"imputation_output_v{file_version_mbs}_{snapshot_name}.csv"
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

loved that you have snapshot name

warnings.warn("A placeholder function for validating dataframe post staging")


def validate_imputation(df: pd.DataFrame, config: dict):
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

missing docs in all validation

Parameters
----------
df : pd.DataFrame
_description_
Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

missing description and types

Copy link
Collaborator

@AntonZogk AntonZogk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Amazing work some very minor changes in the comments, also might be worth adding a unit testing the run main function, this will guarantee that everything's integrated if the pipeline is extended or updated. We test the methods thoroughly so just a basic integration testing should be fine?

Copy link
Collaborator

@AntonZogk AntonZogk left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

All looking good :)

@Jday7879 Jday7879 merged commit 3bd3dd4 into main Nov 6, 2024
5 checks passed
@Jday7879 Jday7879 deleted the 478-update-main branch November 6, 2024 10:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants