-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add output-dir-mirror option #2219
Conversation
programs/util.h
Outdated
int UTIL_chmod(char const* filename, mode_t permissions); /*< like chmod, but avoid changing permission of /dev/null */ | ||
int UTIL_compareStr(const void *p1, const void *p2); | ||
const char* UTIL_getFileExtension(const char* infilename); | ||
int UTIL_isFileNameValidForMirroredOutput(const char *filename); | ||
void UTIL_convertPathnameToDirName(char *pathname); | ||
char* UTIL_trimPath(char *filename); |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Are there cases where the return pointer of this function must be able to mutate or free()
its content ?
I did not notice one. If not, it would be better to return a const char*
.
As a consequence, the argument could be const char*
too, since it's not mutated either.
It also makes it a bit clearer that this function returns a reference, and not a new object.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
UTIL_trimPath will call trimLeadingCurrentDir(), trimLeadingCurrentDir() has user case of returning mutable object.
It remind me the 'strchr()': (https://opensource.apple.com/source/BerkeleyDB/BerkeleyDB-18/db/clib/strchr.c.auto.html)
char *strchr(const char *p, int ch)
{
char c;
c = ch;
for (;; ++p) {
if (*p == c)
return ((char *)p);
if (*p == '\0')
return (NULL);
}
/* NOTREACHED */
}
So seems like even C library is doing the const cast. There is an interesting analysis on this: https://stackoverflow.com/questions/14367727/how-does-strchr-implementation-work. So I can either write 1 function doing const-cast, or write a duplicated functions.
trimLeadingCurrentDir() is 1 liner function, so I probably going to duplicate it into 2 funcitons that return const and non-const type.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Indeed, strchr()
is one of these cases where the standard got it seriously wrong, sacrificing the future for the short-term benefit of existing code. That being said, I'm a bit more sympathetic for errors made during this era, when many concepts where still being worked on. After all, const
was introduced after strchr()
, and this lead to a situation where the function was already deployed, and the committee was under strong pressure to limit code breakage, so adaptation to const
was a tricky situation.
Anyway, nowadays, we have no such excuse. const
is an extremely important property, and it shall be applied and respected scrupulously. If there is a need for 2 versions, one safe which only works on references and provides references, and a more powerful but also more dangerous one which allows full ownership transfer, then it's better to write 2 functions.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I ended up doing a trick on this:
static const char*
trimLeadingCurrentDirConst(const char *filename)
{
assert(filename != NULL); // don't expect null string here
if ((filename[0] == '.') && (filename[1] == PATH_SEP))
return filename + 2;
return filename;
}
static char*
trimLeadingCurrentDir(char *filename)
{
/* 'union charunion' can do const-cast without compiler warning */
union charunion {
char *chr;
const char* cchr;
} ptr;
ptr.cchr = trimLeadingCurrentDirConst(filename);
return ptr.chr;
}
seems to me a good compromise on this. Let me know what you think.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Interesting. That's a use of union
I'm not used to. I'm wondering if it could be prone to strict aliasing issues. If not, then it's a clever use to circumvent const
property.
ade1504
to
05d20a7
Compare
programs/util.c
Outdated
{ | ||
int len = 0; | ||
char* pos = NULL; | ||
// get dir name from pathname similar to 'dirname' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
C90
comment style is limited to /* (...) */
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
done.
Thanks @xxie24 , We'll have a number of back and forth for the tactical details, |
86ca213
to
ab29986
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Some minor details about error case.
Overall, this looks good, well structured.
Also, consider documenting programs/zstd.1.md
.
It's a perfect place to present --output-dir-mirror
, and explain how it's supposed to work, and its limitations.
This markdown document will be later translated into a man
page (you don't have to do that part, it requires some dependencies).
ab29986
to
1c9426a
Compare
1c9426a
to
9a8ccd4
Compare
Thanks @xxie24 , this is a great feature, which is highly welcomed. I note that one last test is failing ( |
No description provided.