Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add output-dir-mirror option #2219

Merged
merged 1 commit into from
Jul 10, 2020
Merged

Conversation

xxie24
Copy link
Contributor

@xxie24 xxie24 commented Jun 23, 2020

No description provided.

programs/fileio.c Outdated Show resolved Hide resolved
programs/util.h Outdated Show resolved Hide resolved
programs/util.c Outdated Show resolved Hide resolved
programs/util.h Outdated
int UTIL_chmod(char const* filename, mode_t permissions); /*< like chmod, but avoid changing permission of /dev/null */
int UTIL_compareStr(const void *p1, const void *p2);
const char* UTIL_getFileExtension(const char* infilename);
int UTIL_isFileNameValidForMirroredOutput(const char *filename);
void UTIL_convertPathnameToDirName(char *pathname);
char* UTIL_trimPath(char *filename);
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Are there cases where the return pointer of this function must be able to mutate or free() its content ?
I did not notice one. If not, it would be better to return a const char*.
As a consequence, the argument could be const char* too, since it's not mutated either.
It also makes it a bit clearer that this function returns a reference, and not a new object.

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

UTIL_trimPath will call trimLeadingCurrentDir(), trimLeadingCurrentDir() has user case of returning mutable object.

It remind me the 'strchr()': (https://opensource.apple.com/source/BerkeleyDB/BerkeleyDB-18/db/clib/strchr.c.auto.html)

char *strchr(const char *p, int ch)
{
	char c;

	c = ch;
	for (;; ++p) {
		if (*p == c)
			return ((char *)p);
		if (*p == '\0')
			return (NULL);
	}
	/* NOTREACHED */
}

So seems like even C library is doing the const cast. There is an interesting analysis on this: https://stackoverflow.com/questions/14367727/how-does-strchr-implementation-work. So I can either write 1 function doing const-cast, or write a duplicated functions.

trimLeadingCurrentDir() is 1 liner function, so I probably going to duplicate it into 2 funcitons that return const and non-const type.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Indeed, strchr() is one of these cases where the standard got it seriously wrong, sacrificing the future for the short-term benefit of existing code. That being said, I'm a bit more sympathetic for errors made during this era, when many concepts where still being worked on. After all, const was introduced after strchr(), and this lead to a situation where the function was already deployed, and the committee was under strong pressure to limit code breakage, so adaptation to const was a tricky situation.

Anyway, nowadays, we have no such excuse. const is an extremely important property, and it shall be applied and respected scrupulously. If there is a need for 2 versions, one safe which only works on references and provides references, and a more powerful but also more dangerous one which allows full ownership transfer, then it's better to write 2 functions.

Copy link
Contributor Author

@xxie24 xxie24 Jun 24, 2020

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I ended up doing a trick on this:

static const char*
trimLeadingCurrentDirConst(const char *filename)
{
    assert(filename != NULL); // don't expect null string here
    if ((filename[0] == '.') && (filename[1] == PATH_SEP))
        return filename + 2;
    return filename;
}

static char*
trimLeadingCurrentDir(char *filename)
{
    /* 'union charunion' can do const-cast without compiler warning */
    union charunion {
        char *chr;
        const char* cchr;
    } ptr;
    ptr.cchr = trimLeadingCurrentDirConst(filename);
    return ptr.chr;
}

seems to me a good compromise on this. Let me know what you think.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Interesting. That's a use of union I'm not used to. I'm wondering if it could be prone to strict aliasing issues. If not, then it's a clever use to circumvent const property.

@xxie24 xxie24 force-pushed the output-dir-mirror branch 2 times, most recently from ade1504 to 05d20a7 Compare June 23, 2020 20:49
programs/fileio.c Outdated Show resolved Hide resolved
programs/fileio.c Outdated Show resolved Hide resolved
programs/util.c Outdated
{
int len = 0;
char* pos = NULL;
// get dir name from pathname similar to 'dirname'
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

C90 comment style is limited to /* (...) */

Copy link
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

done.

programs/zstdcli.c Outdated Show resolved Hide resolved
@Cyan4973
Copy link
Contributor

Thanks @xxie24 ,
this is a rather good beginning.
You managed to integrate this capability in a way which makes sense within this project.
Also, the way you divide the topic into unitary components is appropriate.

We'll have a number of back and forth for the tactical details,
mostly around code conventions and specific precautions to C programming,
but I believe you've got the overall strategy right.

@xxie24 xxie24 force-pushed the output-dir-mirror branch 10 times, most recently from 86ca213 to ab29986 Compare June 24, 2020 23:00
@xxie24 xxie24 marked this pull request as draft June 24, 2020 23:02
@xxie24 xxie24 marked this pull request as ready for review June 25, 2020 00:03
Copy link
Contributor

@Cyan4973 Cyan4973 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Some minor details about error case.
Overall, this looks good, well structured.

Also, consider documenting programs/zstd.1.md.
It's a perfect place to present --output-dir-mirror, and explain how it's supposed to work, and its limitations.

This markdown document will be later translated into a man page (you don't have to do that part, it requires some dependencies).

programs/fileio.c Show resolved Hide resolved
programs/fileio.c Show resolved Hide resolved
@xxie24 xxie24 force-pushed the output-dir-mirror branch from ab29986 to 1c9426a Compare June 25, 2020 02:15
@xxie24 xxie24 requested a review from Cyan4973 June 25, 2020 02:15
@xxie24 xxie24 force-pushed the output-dir-mirror branch from 1c9426a to 9a8ccd4 Compare June 25, 2020 05:13
@Cyan4973
Copy link
Contributor

Thanks @xxie24 , this is a great feature, which is highly welcomed.

I note that one last test is failing (min-decomp-macros),
though I somewhat remember that it might be one of the new tests that tend to be a bit flackey. cc @bimbashrestha

@Cyan4973 Cyan4973 merged commit a3296da into facebook:dev Jul 10, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants