Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(metadata): Add profiles information to cargo metadata #12449

Closed
wants to merge 5 commits into from

Conversation

calavera
Copy link
Contributor

@calavera calavera commented Aug 5, 2023

Fixes #11228

I do understand that the team is not accepting PRs for issues that have not been accepted. However, I believe these changes are scoped well enough to be considered for review. Nevertheless, feel free to close this PR without reviewing, I understand that I didn't follow the contribution rules. In honesty, I wrote the code before I saw that warning, and I didn't want to keep the code for myself 😅

What does this PR try to resolve?

This change adds profile information to the output of cargo metadata. The output only changes if the user has included any specific profile settings in their Cargo.toml file, it doesn't expose default profiles that have not been modified. I opted for serializing this information only when there is information present because I believe it's a minimal incremental improvement that's easier to test, review and accept than if I tried to expose profile information that is not explicitly in Cargo.toml files.

How should we test and review this PR?

I've added integration tests in the metadata test suite for these changes.

@rustbot
Copy link
Collaborator

rustbot commented Aug 5, 2023

Thanks for the pull request, and welcome! The Rust team is excited to review your changes, and you should hear from @ehuss (or someone else) soon.

Please see the contribution instructions for more information. Namely, in order to ensure the minimum review times lag, PR authors and assigned reviewers should ensure that the review label (S-waiting-on-review and S-waiting-on-author) stays updated, invoking these commands when appropriate:

  • @rustbot author: the review is finished, PR author should check the comments and take action accordingly
  • @rustbot review: the author is ready for a review, this PR will be queued again in the reviewer's queue

@rustbot rustbot added A-manifest Area: Cargo.toml issues Command-metadata S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. labels Aug 5, 2023
@calavera calavera force-pushed the profile_info_in_metadata branch from 08b44bf to 17fc760 Compare August 5, 2023 16:45
@calavera
Copy link
Contributor Author

calavera commented Aug 5, 2023

Tests are failing because of something that looks like unrelated to these changes. I ran those same tests locally, and they pass:

 error: the feature `lang_items` is internal to the compiler or standard library
 --> src/lib.rs:4:12
  |
4 | #![feature(lang_items)]
  |            ^^^^^^^^^^
  |

@ehuss
Copy link
Contributor

ehuss commented Aug 5, 2023

Sorry about the hiccup, I have posted #12450 to fix the CI issue.

@calavera calavera force-pushed the profile_info_in_metadata branch from 17fc760 to 18c4f98 Compare August 6, 2023 00:00
@calavera
Copy link
Contributor Author

calavera commented Aug 6, 2023

Thanks for the quick fix @ehuss. I've rebased from master and tests pass now as expected.

@bors
Copy link
Contributor

bors commented Aug 25, 2023

☔ The latest upstream changes (presumably #12553) made this pull request unmergeable. Please resolve the merge conflicts.

Comment on lines 44 to 47
let workspace_profiles = match ws.root_maybe() {
MaybePackage::Virtual(vm) => vm.profiles().cloned(),
_ => None, // regular packages include the profile information under their package section
};
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I'm a little concerned about this approach. For example, if you have a non-virtual workspace, this could make it quite difficult to determine what the profiles are for the workspace, since you would need to somehow fish it out of the packages array. I would suggest just including ws.profiles() as-is.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

sounds good, I can make that update.

@ehuss ehuss added the T-cargo Team: Cargo label Aug 30, 2023
@ehuss
Copy link
Contributor

ehuss commented Aug 30, 2023

@rfcbot fcp merge

@rust-lang/cargo Just checking if people would be ok with adding the TOML profiles from the manifest in cargo metadata. I believe this will also show up with cargo read-manifest.

@rfcbot
Copy link
Collaborator

rfcbot commented Aug 30, 2023

Team member @ehuss has proposed to merge this. The next step is review by the rest of the tagged team members:

Concerns:

Once a majority of reviewers approve (and at most 2 approvals are outstanding), this will enter its final comment period. If you spot a major issue that hasn't been raised at any point in this process, please speak up!

See this document for info about what commands tagged team members can give me.

@rfcbot rfcbot added proposed-final-comment-period An FCP proposal has started, but not yet signed off. disposition-merge FCP with intent to merge labels Aug 30, 2023
Comment on lines 108 to 109
#[serde(skip_serializing_if = "Option::is_none")]
profiles: Option<TomlProfiles>,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Cargo only looks at the profile settings in the Cargo.toml manifest at the root of the workspace. Profile settings defined in dependencies will be ignored.

imo we should only serialize profiles at the workspace level so people get something more akin to expected cargo behavior

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

that would simplify things and aligns with @ehuss comment above. I can remove it from there and just call it profiles in the output metadata.

Comment on lines 75 to 76
#[serde(skip_serializing_if = "Option::is_none")]
workspace_profiles: Option<TomlProfiles>,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The output only changes if the user has included any specific profile settings in their Cargo.toml file, it doesn't expose default profiles that have not been modified.

For me, the question is what the intended use case is we are covering. All callers will need to reinvent cargo's profile system to fully interpret this, including reading from config. Is that ok? Can we change this in the future if we find it doesn't work?

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In my case, Cargo's default profiles are not relevant. I want to know profile information that people set in their projects explicitly. I'll be happy to expose the default profile data if you think it'd be useful, but I don't know where to find it.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

One specific use case doesn't tell us much though. We are looking at adding a new feature and we need to consider other users and future users to make sure we are providing the right thing.

Do we understand enough to move forward either direction?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

In my usecase I would like to have the default profiles available. My usecase would be to provide CMake users an option to select one of the available cargo profiles. For this I would like to have
a) name of the profile
b) path of the profile (mainly because dev maps to debug)

This list should include the default profiles, although I'm not sure about test and bench though, since those profiles are weird and artifacts don't land at a predictable location due to the hash being added.

Comment on lines 75 to 76
#[serde(skip_serializing_if = "Option::is_none")]
workspace_profiles: Option<TomlProfiles>,
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems like this should be profiles instead of workspace_profiles

It looks like we only use the workspace prefix when qualifying things workspace root, workspace members) but otherwise we don't (e.g. metadata, resolve)

@epage
Copy link
Contributor

epage commented Aug 30, 2023

@rfcbot concern default-config

See #12449 (comment)

@calavera calavera force-pushed the profile_info_in_metadata branch 3 times, most recently from 3623e27 to 30af6de Compare September 1, 2023 04:00
@calavera
Copy link
Contributor Author

calavera commented Sep 1, 2023

@ehuss @epage thanks for the feedback. I've made some changes based on your comments.

@epage I looked into exposing the default profiles too. It looks like I could do that if I change the implementation to use Profiles::new, and then expose the original profiles with a new function like this one:

+
+    /// Returns the original profiles written by the user in the manifest and config.
+    pub fn original_profiles(&self) -> &BTreeMap<InternedString, TomlProfile> {
+        &self.original_profiles
+    }

I would like to check with you that that's the right direction because I will have to update all the metadata output tests, and I don't want you all to review something that's not what you expect. Let me know what you think.

@epage
Copy link
Contributor

epage commented Sep 1, 2023

I would like to check with you that that's the right direction

My question isn't so much about what needs to be changed but exploring the requirements for the change itself. We need to understand that before we know what the right change is.

@calavera
Copy link
Contributor Author

calavera commented Sep 1, 2023

@epage that makes sense. My usecase is described in this comment: #11228 (comment)

@rustbot rustbot added the A-profiles Area: profiles label Sep 25, 2023
@calavera calavera force-pushed the profile_info_in_metadata branch 2 times, most recently from 5960ae1 to fb3a571 Compare September 25, 2023 02:15
@calavera calavera force-pushed the profile_info_in_metadata branch from fb3a571 to d480856 Compare September 25, 2023 02:33
This change adds profile information to the output of `cargo metadata`. I opted for serializing this information only when there is information present to minimize changes in the output.

Signed-off-by: David Calavera <[email protected]>
Implement a new method Profiles::all that returns all profiles in the workspace, including the root and predefined profiles.

This new method follows Cargo resolution rules to resolve profile inheritance and ordering.

Signed-off-by: David Calavera <[email protected]>
Signed-off-by: David Calavera <[email protected]>
@calavera calavera force-pushed the profile_info_in_metadata branch from d480856 to ffb2401 Compare September 25, 2023 02:41
Signed-off-by: David Calavera <[email protected]>
@calavera
Copy link
Contributor Author

calavera commented Sep 25, 2023

@epage, I've been looking at the use cases in this comment, and #11228. I think you're right, and exposing only information defined by the user in the Cargo.toml file is not enough. I did some digging today, and I created a new function that extracts all profiles in a workspace, including root and predefined profiles. The metadata command serializes that list instead of toml definitions. It's a bigger change than I wanted to make, but it's mostly decoupling functions from the Profiles struct.

Let me know what you think.

@@ -1081,7 +1081,84 @@ fn cmd_metadata_with_embedded() {
"target_directory": "[ROOT]/home/.cargo/target/[..]",
"version": 1,
"workspace_root": "[..]/foo",
"metadata": null
"metadata": null,
"profiles": {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should we reuse DEFAULT_PROFILES here?

fn root_profiles(
profiles: &BTreeMap<InternedString, TomlProfile>,
) -> Vec<(InternedString, ProfileMaker)> {
vec![
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We dont need a Vec here I guess? an [(InternedString, ProfileMaker); 2] should be sufficient.

/// Returns the built-in profiles (not including dev/release, which are
/// "root" profiles).
fn predefined_profiles() -> Vec<(&'static str, TomlProfile)> {
vec![
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

ditto. [(&'static str, TomlProfile); 3] is good to go.

@@ -6,6 +6,84 @@ use cargo_test_support::registry::Package;
use cargo_test_support::{basic_bin_manifest, basic_lib_manifest, main_file, project, rustc_host};
use serde_json::json;

pub(crate) const DEFAULT_PROFILES: &str = r#"{
"bench": {
"codegen_backend": null,
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Should codegen_backend be skipped, as the feature is not stable yet?

"debug_assertions": false,
"debuginfo": 0,
"incremental": false,
"lto": "false",
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

it seems odd that lto false is a stringified "false".

Comment on lines +933 to +936
match self {
Strip::None => "none".serialize(s),
Strip::Named(n) => n.serialize(s),
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Suggested change
match self {
Strip::None => "none".serialize(s),
Strip::Named(n) => n.serialize(s),
}
s.collect_str(self)

BTW it would be great if we can make commit atomic. Like, this change can fit it one commit of its own.

Comment on lines +84 to +90

for (name, profile) in &definitions {
let name = *name;
if let Some(maker) = create_profile_maker(name, profile, &makers, &definitions)? {
makers.insert(name, maker);
}
}
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Duplicate?

Suggested change
for (name, profile) in &definitions {
let name = *name;
if let Some(maker) = create_profile_maker(name, profile, &makers, &definitions)? {
makers.insert(name, maker);
}
}

Ok(profile_makers)
}

/// Returns the hard-coded directory names for built-in profiles.
fn predefined_dir_names() -> HashMap<InternedString, InternedString> {
Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This seems like a refactor in order to reuse code between Profiles::new and Proflies::all. I would suggest commit the refactor in a commit, and the other for adding the new feature. It helps both review and tracing Git history.

@weihanglo
Copy link
Member

@rfcbot concern "JSON message compatibility"

This PR insta-stabilizes all profiles settings in JSON format. However, we do have some issues around JSON message compatbility, especially when we want to add more types to a field later. The issue is tracked in #12377.

Do we want to resolve #12377 first then merge this PR? Should we look into each field and determine the serialization format one by one? Or both?

@calavera
Copy link
Contributor Author

This PR insta-stabilizes all profiles settings in JSON format. However, we do have some issues around JSON message compatbility, especially when we want to add more types to a field later. The issue is tracked in #12377.

I was not aware of that ticket, but I'll take a look. I basically didn't want to change anything in the format for backwards compatibility, but if there are concerns about the format itself, I can help to stabilize it before landing all these changes.

@calavera
Copy link
Contributor Author

calavera commented Oct 5, 2023

I'm going to close this PR for now. After digging through the code, I have concerns about how best expose profiles in the metadata. The lack of stable format doesn't help either. I'll try to send a different PR with some code improvements that make the profiles module more testeable, that should give people more confidence to look into this part of the codebase.

@calavera calavera closed this Oct 5, 2023
@rfcbot rfcbot removed proposed-final-comment-period An FCP proposal has started, but not yet signed off. disposition-merge FCP with intent to merge labels Oct 5, 2023
@weihanglo
Copy link
Member

Thank you @calavera for working hard on this from different angles!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A-manifest Area: Cargo.toml issues A-profiles Area: profiles Command-metadata S-waiting-on-review Status: Awaiting review from the assignee but also interested parties. T-cargo Team: Cargo
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Cargo metadata: Expose available build profiles
8 participants