-
Notifications
You must be signed in to change notification settings - Fork 616
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Set base address on OSX #313
Conversation
This fixes issue google#311. The mapped address of libraries was never considered, nor was the load address of the segments.
Codecov Report
@@ Coverage Diff @@
## master #313 +/- ##
=========================================
- Coverage 65.86% 65.8% -0.06%
=========================================
Files 36 36
Lines 7402 7429 +27
=========================================
+ Hits 4875 4889 +14
- Misses 2132 2142 +10
- Partials 395 398 +3
Continue to review full report at Codecov.
|
I can accept the fix as is because it's an improvement we need, but how about a test? |
Sure. What should it test exactly though? I could throw a small mach-o library and binary in there to check |
Yes, a test similar to the ones in https://github.com/google/pprof/blob/master/internal/binutils/binutils_test.go against a canned binary from testdata. The test will need to be skipped on anything non-Mac. |
There is already a function very similar to getTextAddress(). This also adds a test that tries reading symbols / source addresses from a library and an executable. I also had to fix a couple of bugs for the tests to work: 1. Crash when the regex passed to f.Symbols() was nil. 2. The last symbol returned by `nm` is always discarded.
I added a test. The test actually found some bugs elsewhere in the code. Also I'm not 100% sure about the intent of some of the test I copied it from. Anyway the test passes and I guess it was useful as it contains bug fixes. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do files internal/binutils/debug.test, internal/binutils/testdata/exe_mac_64.dSYM/Contents/Info.plist, internal/binutils/testdata/exe_mac_64.dSYM/Contents/Resources/DWARF/exe_mac_64, internal/binutils/testdata/lib_mac_64.dSYM/Contents/Info.plist, internal/binutils/testdata/lib_mac_64.dSYM/Contents/Resources/DWARF/lib_mac_64 need to be part of the PR?
internal/binutils/binutils.go
Outdated
@@ -179,10 +179,20 @@ func (b *binrep) openMachO(name string, start, limit, offset uint64) (plugin.Obj | |||
} | |||
defer of.Close() | |||
|
|||
base := start |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I am a bit hesitant whether a better value for the base in the absence of __TEXT is zero or start
. But I guess it's not important since files without __TEXT should be impractical case.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah I'll add a warning if it isn't found.
internal/binutils/binutils_test.go
Outdated
t.Fatalf("Symbols: unexpected error %v", err) | ||
} | ||
|
||
find := func(name string) *plugin.Sym { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit: could move this piece out to an outer function to reuse with the other test.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Will do.
@@ -62,7 +69,7 @@ func matchSymbol(names []string, start, end uint64, r *regexp.Regexp, address ui | |||
return names | |||
} | |||
for _, name := range names { | |||
if r.MatchString(name) { | |||
if r == nil || r.MatchString(name) { |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Do we pass r as nil from anywhere? Ah, you do know in f.Symbols(nil, 0)
in the test. I think you can just do f.Symbols(regexp.MustCompile(tc.sym), 0)
?.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
No but the comments for the Symbols()
interface say it can be nil.
internal/binutils/disasm.go
Outdated
@@ -37,6 +37,7 @@ func findSymbols(syms []byte, file string, r *regexp.Regexp, address uint64) ([] | |||
var symbols []*plugin.Sym | |||
names, start := []string{}, uint64(0) | |||
buf := bytes.NewBuffer(syms) | |||
symAddr := uint64(0) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I think this symAddr will get shadowed by the assignment in the loop header, so it's probably not what the code after the loop expects.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Well spotted!
@Timmmm Huh, Mac tests fail on Travis with
|
Hoisting out this question from the last round of comments because those top-level Github comments are easy to miss so not sure you saw it: Do files internal/binutils/debug.test, internal/binutils/testdata/exe_mac_64.dSYM/Contents/Info.plist, internal/binutils/testdata/exe_mac_64.dSYM/Contents/Resources/DWARF/exe_mac_64, internal/binutils/testdata/lib_mac_64.dSYM/Contents/Info.plist, internal/binutils/testdata/lib_mac_64.dSYM/Contents/Resources/DWARF/lib_mac_64 need to be part of the PR? |
Ah yeah I missed that, and yes they do, except |
Yeah that seems to be it. Removing However |
@Timmmm A lot of development here happens on Linux. I carry a Mac laptop but a lot of times I'm just in an SSH window to my Linux machine. And when I do use it for actual coding, that's Go and I'm not hitting the issues you've found because the Golang profiles are pre-symbolized by the runtime. Thanks for figuring many things out here! |
* Return an error if __TEXT isn't found * Return an error if start - textSegment.addr would underflow * Break out findSymbol() function * Rework findSymbols() function, it now returns non-EOF errors instead of ignoring them * Remove accidentally added debug binary
I've updated it as requested. The test will still fail because
I'd suggest the easiest thing for now is to install |
Oh yeah I also forgot to ask - is the logic behind |
internal/binutils/binutils.go
Outdated
|
||
textSegment := of.Segment("__TEXT") | ||
if textSegment == nil { | ||
return nil, fmt.Errorf("Parsing %s: no __TEXT segment", name) |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Error messages should start with lowercase: https://github.com/golang/go/wiki/CodeReviewComments#error-strings
Suggesting fmt.Errorf("not a mach executable: no __TEXT segment: %s", name)
, to be somewhat consistent with fmt.Errorf("unrecognized binary: %s", name)
above in this file.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
I just copied the one above but ok.
internal/binutils/binutils.go
Outdated
return nil, fmt.Errorf("Parsing %s: no __TEXT segment", name) | ||
} | ||
if textSegment.Addr > start { | ||
return nil, fmt.Errorf("Parsing %s: __TEXT segment address (0x%x) > mapping start address (0x%x)", |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Error messages should start with lowercase: https://github.com/golang/go/wiki/CodeReviewComments#error-strings
Suggesting fmt.Errorf("not a mach executable: __TEXT segment address (0x%x) > mapping start address (0x%x): %s", testSegment.Addr, start, name)
.
By the way, could base
be negative in case of some exotic executable ASLR? Probably not, I think in reality for PIE executables textSegment address is zero, at least that's what I saw on Linux.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yeah I wasn't sure about that but even if it is you'd probably want to know about it - i.e. get a bug report here, rather than just having it underflow and fail silently (or get lucky with the unsigned arithmetic).
internal/binutils/disasm.go
Outdated
for { | ||
symAddr, name, err := nextSymbol(buf) | ||
if err == io.EOF { | ||
// Done! If there was an unfinished group, append it. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Nit / opiniated: remove the exclamation mark, those are overly emotional for comments and log messages to my taste...
@@ -0,0 +1,20 @@ | |||
<?xml version="1.0" encoding="UTF-8"?> |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Reading https://stackoverflow.com/questions/8058497/can-clang-build-executables-with-dsym-debug-information, it appears to me that building the test binaries from the command line it should be possible to have the debug information as part of those, and these additional dSYM files and directory would not be needed. I can look into doing that myself, can you include the source code of the executable and library test binaries that you used?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Yep you can build it from source too. I just copied how the Linux test worked.
The only weird thing is if you have linking as a separate step (i.e. clang++ -c foo.cpp -o foo.o; clang++ foo.o -o foo
rather than just doing it all in one step you need to run dysmutil
on the binary. Shouldn't matter in this case because it is just one file.
@timm I left some comments, those are minor, please address. I apologize it took so much longer than I thought it would. I'd probably suggest that you leave the test code in as is, but make the mac test skipped permanently, and I'll look into enabling that. For |
Ok I fixed some of the nits (getting a little extreme!) and lowercased other error messages. I didn't use your exact error message suggestions because they still are mach files - and there was already an error message for failing to find the base address for ELF so I copied that. The source I used was:
Compiled with |
@Timmmm LGTM, merging. Thanks for bearing with those nit comments. And thanks for this fix overall. I'll get the test fixed. |
Set base address on OSX. This fixes issue google#311. The mapped address of libraries was never considered, nor was the load address of the segments. It also fixes a couple of issues in binutils.go, such as when the tail of the data wasn't handled right when grouping symbols at the same address.
Set base address on OSX. This fixes issue google#311. The mapped address of libraries was never considered, nor was the load address of the segments. It also fixes a couple of issues in binutils.go, such as when the tail of the data wasn't handled right when grouping symbols at the same address.
This fixes issue #311. The mapped address of libraries was never considered, nor was the load address of the segments.