Iso9660 #82

ghost · 2018-02-03T17:38:04Z

This is far from done, but it the review should not become too big

… cannot parse valid ISO + rockridge anymore

…d which doc it references

filesystem/iso9660.ksy

KOLANICH · 2018-02-04T14:10:17Z

filesystem/iso9660.ksy

+      - id: rec_date_time
+        doc-ref: ecma-119 9.1.5
+        type: recdatetime
+        if: len_dr > 0x0


I guess we need to move the comparison into an instance and use in if that instance.

KOLANICH · 2018-02-04T14:13:06Z

filesystem/iso9660.ksy

+      - id: publisher_identifier
+        doc-ref: ecma-119 8.5.14
+        type: str
+        size: 0x80


Consider moving these stings of predefined lengths into own types

KOLANICH · 2018-02-04T14:14:43Z

filesystem/iso9660.ksy

-  vol_desc_boot_record:
+        doc-ref: ecma-119 8.1.3
+        contents: [0x01]
+      - id: boot_record


Switch should be here too.

filesystem/iso9660.ksy

KOLANICH · 2018-02-04T15:43:14Z

filesystem/iso9660.ksy

+        type: b1
+      - id: creation_short
+        type: datetime_short
+        if: ( creation == true ) and ( long_form == false )


I see lots of long_form == true and long_form == false.
Consider moving long and short forms into separate types.

KOLANICH · 2018-02-04T16:28:27Z

filesystem/iso9660.ksy

+        if: long_form == false
+      - id: datetime_long
+        type: rrip_tf_long
+        if: long_form == true


I guess == true is not needed. x == false is not x.
Use switch-on and cases instead of sequential ifs and make sure that the types have compatible interface.

KOLANICH · 2021-04-04T22:09:06Z

This ISO9660 spec works, the file in kaitai-io/kaitai_struct_formats does not.

Thank you for the clarification, and especially for the spec.

filesystem/iso9660.ksy

KOLANICH · 2021-04-04T22:20:38Z

filesystem/iso9660.ksy

-        encoding: UTF-8
-      - id: bibliographic_file_id
+        size: 0x2
+      - id: minute


in another occurence it was min

My variables are the same as in the document, but I follow multiple documents.
So the naming style does sometimes change.

If I rename things the validation against the original spec would get much harder.

-orig-id:

Same discussion: #82 (comment)

KOLANICH · 2021-04-04T22:25:07Z

filesystem/iso9660.ksy

+          - id: volume_flags_not_iso2375
+            doc-ref: ecma-119 8.5.3 b0
+            type: b1
+          - id: system_identifier


duplication

I don't see it. One is system other is volume, even the doc-ref is different

https://github.com/kaitai-io/kaitai_struct_formats/pull/82/files#diff-1e076bd9a91d2da636ba57110890a3857b5ebade3b60b49ed57f9d1d35dc08d2R162

I have implemented these standards exactly!
The fact that ref. 8.4 & 8.5 partially contain the same items and not a reference to example a u4bi is not for me to fix.

primary: doc-ref: ecma-119 8.4

- id: system_identifier doc-ref: ecma-119 8.4.5 type: text32 - id: volume_identifier doc-ref: ecma-119 8.4.6 type: text32

supplementary: doc-ref: ecma-119 8.5

- id: system_identifier doc-ref: ecma-119 8.5.4 type: text32 - id: volume_identifier doc-ref: ecma-119 8.5.5 type: text32

IMHO the fact that it is written so in the original spec shouldn't prevent us from eliminating redundancy. So, it is for you to fix, their spec is their, but the ksy spec is yours!

Well I guess we conflict on 2 items... this and the naming of some identifiers as that topic came up 2 times.

This PR works and will greatly extent the support of the format.
If you don't want my code, then let me know and I will close the pr.

Don't close your PR, your PR is welcome here! The notices made are not requirements, but advices.

KOLANICH · 2021-04-04T22:28:13Z

filesystem/iso9660.ksy

+                            type: b1
+                          - id: content
+                            type: str
+                            size: length - 5


It may make sense to move the 5 bytes into a subtype and replace 5 with sizeof<the_subtype>. Thsre are probably other places it can be done.

Well in the spec they give the total size of the block incl the header, so you have to reduce the size of the header.
And making breaking it into a subtype will just make more of a mess.

And making breaking it into a subtype will just make more of a mess.

Yes, I have to agree - it probably will. However, I have a better idea - you use these specific types in header:

kaitai_struct_formats/filesystem/iso9660.ksy

Lines 513 to 535 in 619e1ba

- id: specific

type:

switch-on: signature

cases:

'signature::aaip_attribute_list': susp_unknown # AL

'signature::rras_amiga_specific': rras_as # AS

'signature::susp_continuation_area': susp_ce # CE

'signature::rrip_child_link': rrip_cl # CL

'signature::susp_extensions_reference': susp_er # ER

'signature::susp_extension_selector': susp_es # ES

'signature::rrip_alternate_name': rrip_nm # NM

'signature::susp_padding_field': susp_pd # PD

'signature::rrip_parent_link': rrip_pl # PL

'signature::rrip_posix_device_number': rrip_pn # PN

'signature::rrip_posix_file_attributes': rrip_px # PX

'signature::rrip_relocated_directory': rrip_re # RE

'signature::rrip_extensions_in_use_indicator': susp_unknown # RR

'signature::rrip_sparse_file': rrip_sf # SF

'signature::rrip_symbolic_link': susp_sl # SL

'signature::susp_indicator': susp_sp # SP

'signature::susp_terminator': susp_st # ST

'signature::rrip_time_file': rrip_tf # TF

'signature::rrzf_zisofs': rrzf_zf # ZF

and in fact, all of them start with length and version.

kaitai_struct_formats/filesystem/iso9660.ksy

Lines 958 to 964 in 619e1ba

susp_unknown: # default for now

seq:

- id: length

type: u1

- id: version

type: u1

- id: data

kaitai_struct_formats/filesystem/iso9660.ksy

Lines 558 to 566 in 619e1ba

rras_as:

doc-ref: rras

seq:

- id: length

type: u1

- id: version

type: u1

valid: 1

- id: reserved

kaitai_struct_formats/filesystem/iso9660.ksy

Lines 645 to 653 in 619e1ba

susp_ce:

doc-ref: susp 5.1

seq:

- id: length

type: u1

- id: version

type: u1

valid: 1

- id: ca_location

and so on. I checked that this pair appears in the beginning of all fields, with the only exception that susp_unknown doesn't have the valid: 1 constraint on version (if this is intentional and not an oversight, we can easily get around it by using if or valid/expr).

It would be much better if you would extract these common fields right into header (because in fact, why not, it makes sense in my opinion - the names length and version sound like they totally fit into a header structure):

kaitai_struct_formats/filesystem/iso9660.ksy

Lines 508 to 513 in 619e1ba

header:

seq:

- id: signature

type: u2be

enum: signature

- id: specific

header: seq: - id: signature type: u2be enum: signature + - id: length + type: u1 + - id: version + type: u1 + valid: 1 - id: specific

The nice thing about it is that now you have the length available in the parent struct, so you can simply put the field specific in a substream. Then you will be able to throw out all the size: length - 5 calculations and just use size-eos: true. Also, it won't allow the specific types to possibly get out of the designated substream and the parsing will fail on the first field that doesn't fit in there (not by ending up with a negative size from those size: length - 5 calculations, which will also lead to an error, but fields with these calculations are not even present in all types - not sure if it's explicitly written in the reference spec that the length should be disregarded in some specific types, but if it's not, I'd definitely confine every specific type within that substream).

header: seq: @@ ... @@ - id: specific + size: length - (signature._sizeof + length._sizeof + version._sizeof) # note: this `_sizeof` sum should probably be in a `value` instance type: switch-on: signature cases: 'signature::aaip_attribute_list': susp_unknown # AL @@ ... @@ 'signature::rrzf_zisofs': rrzf_zf # ZF

Now, you will just take each specific type, for example:

kaitai_struct_formats/filesystem/iso9660.ksy

Lines 722 to 731 in 619e1ba

susp_pd:

doc-ref: susp 5.2

seq:

- id: length

type: u1

- id: version

type: u1

valid: 1

- id: padding_area

size: length - 4

..., simply throw out length and version, and replace size: length - 4 with size-eos: true:

susp_pd: doc-ref: susp 5.2 seq: - - id: length - type: u1 - - id: version - type: u1 - valid: 1 - id: padding_area - size: length - 4 + size-eos: true

You see how much better it is? You'll replace 18 identical length, version pairs in every single specific type with 1 common pair in header, you'll treat the length consistently and correctly, and eliminate the magic numbers in manual size: length - 5 calculations. What more could we possibly wish for?

"What more could we possibly wish for?"
For the feeling to go away that this PR is eternally unmergeable.

You can have it as-is and improve on it in your own time.

filesystem/iso9660.ksy

armijnhemel · 2022-01-04T16:43:29Z

filesystem/iso9660.ksy

+                            type: b1
+                          - id: current
+                            type: b1
+                          - id: continue


I would recommend changing this to continued as continue is a reserved keyword in Python and it is causing some issues.

armijnhemel · 2022-01-04T16:44:02Z

filesystem/iso9660.ksy

+                            type: b1
+                          - id: current
+                            type: b1
+                          - id: continue


I would also recommend changing this to continued as continue is a reserved keyword in Python

maybe continuation?

Please see

I have read it and what you are suggesting is exactly what I am doing. I am using it (in another project) and verifying and adapting it. I am just leaving the comments here for future reference.

armijnhemel · 2022-01-05T14:54:40Z

filesystem/iso9660.ksy

+                  - id: header
+                    type: header
+                    if: _io.size - _io.pos >= 2
+                    repeat: eos


According to IEEE P1281 a valid SUSP entry is at least 4 bytes long:

If the remaining allocated space following the last recorded System Use Entry in a System Use field or Continuation Area is less than four bytes long, it cannot contain a System Use Entry and shall be ignored.

I have adapted susp to the following:

susp: doc: | We check if we have at least 4 bytes left. The SUSP magic is 2 bytes. A complete SUSP entry is at least 4 bytes long. seq: - id: header type: header if: _io.size - _io.pos >= 4 repeat: until repeat-until: _io.size - _io.pos < 4

armijnhemel · 2022-01-05T19:47:22Z

filesystem/iso9660.ksy

+            type: u1
+          - id: body
+            type: body
+            if: len_dr > 0x0


size: len_dr - len_dr._sizeof

armijnhemel · 2022-01-05T20:00:30Z

filesystem/iso9660.ksy

+                  The size of the system_use is defined as:
+                  ( len_dr - size of directory_record = 33 bytes ) - len_fi
+                type: susp
+                size: ( _parent.len_dr - 33 ) - len_fi


This is actually not correct: susp is whatever it is from up to that point, to the end of the directory record. If there is a padding byte, then it should be decreased by 1.

By specifying the size it becomes really easy to avoid this problem:

size-eos: true

armijnhemel · 2022-01-05T20:18:18Z

filesystem/iso9660.ksy

+          - id: directory_record
+            type: directory_record
+            repeat: until
+            repeat-until: _.len_dr == 0


This is actually not correct, as there can be additional padding bytes. Section 6.8.1.1 of the ISO9660 spec says:

Each Directory Record shall end in the Logical Sector in which it begins. Unused byte positions after the last Directory Record in a Logical Sector shall be set to (00).

So this means that you also need to look at the remaining bytes in the logical sector as well as the total length of the directory records.

SnowFlakey added 17 commits January 28, 2018 17:36

iso9660 from scratch, alpha version

17ce85b

More work, still very alpha

9b7a95d

Worked on the directory_record ( ref: 9.1 )

370b2b2

doc-ref work

7d82e92

More, doc-ref, some unused fields more strict, but I broke something,…

637e5d9

… cannot parse valid ISO + rockridge anymore

Fixed errors in directory_record, got rid of unproven items

5b9ad58

Changed order like the document, moved doc-ref as first element, adde…

bb55e5b

…d which doc it references

ascii to ASCII

29c7892

Fixed padding field logic, added first bits of SUSP

19ac579

Made recursive directories and files work.

1321741

padding_field fix, susp to su, added rrip values

72cc495

prep for first pull request, disabled dead code

550a2a5

Added SU Amiga Header

7616c0b

Zisofs SU header

d41793e

Actually show the content of files

c2be451

Broken code, SU header logic

e71d89c

Safe SUSP processing but every type is unknown for now

2dfad40

GreyCat reviewed Feb 4, 2018

View reviewed changes

filesystem/iso9660.ksy Outdated Show resolved Hide resolved

KOLANICH reviewed Feb 4, 2018

View reviewed changes

filesystem/iso9660.ksy Outdated Show resolved Hide resolved

KOLANICH reviewed Feb 4, 2018

View reviewed changes

filesystem/iso9660.ksy Show resolved Hide resolved

Renamed datetime functions, implemented rrip_nm, rrip_px and rrip_tf

76c5c1a

KOLANICH reviewed Feb 4, 2018

View reviewed changes

rrip_tf_short and rrip_tf_long as sub types

4a492cf

KOLANICH reviewed Feb 4, 2018

View reviewed changes

SnowFlakey added 3 commits February 4, 2018 17:52

Switched to switch

accc96e

Restored header for former glory

e2ce26d

Changed if true to if and if false to if not

028d853

KOLANICH reviewed Apr 4, 2021

View reviewed changes

filesystem/iso9660.ksy Outdated Show resolved Hide resolved

KOLANICH reviewed Apr 4, 2021

View reviewed changes

filesystem/iso9660.ksy Show resolved Hide resolved

KOLANICH reviewed Apr 4, 2021

View reviewed changes

filesystem/iso9660.ksy Show resolved Hide resolved

KOLANICH reviewed Apr 4, 2021

View reviewed changes

generalmimon reviewed Apr 4, 2021

View reviewed changes

filesystem/iso9660.ksy Show resolved Hide resolved

generalmimon reviewed Apr 4, 2021

View reviewed changes

filesystem/iso9660.ksy Outdated Show resolved Hide resolved

generalmimon removed the waiting for author label Apr 4, 2021

generalmimon reviewed Apr 4, 2021

View reviewed changes

filesystem/iso9660.ksy Outdated Show resolved Hide resolved

iso9660: multiple items of feedback

2d5bc21

generalmimon reviewed Apr 4, 2021

View reviewed changes

filesystem/iso9660.ksy Outdated Show resolved Hide resolved

iso9660: fix URL

438bba3

generalmimon reviewed Apr 4, 2021

View reviewed changes

filesystem/iso9660.ksy Outdated Show resolved Hide resolved

SnowFlakey added 6 commits April 5, 2021 08:02

iso9660: revert content, it breaks the code

d4d48a4

iso9660: removed all size in hex

eb12b4a

iso9660: padding in hex

3188df4

iso9660: content style changes

5e2809f

iso9660: fixing ecma-119 8.4.4 style too

80640d1

iso9660: check added if le and be values are equal

619e1ba

armijnhemel reviewed Jan 4, 2022

View reviewed changes

armijnhemel reviewed Jan 5, 2022

View reviewed changes

Merge remote-tracking branch 'upstream/master' into iso9660

5f8d5e1

armijnhemel reviewed Jan 5, 2022

View reviewed changes

generalmimon mentioned this pull request Jul 4, 2023

Dir Entires limit 50 #679

Open

ghost closed this by deleting the head repository Nov 23, 2024

This pull request was closed.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Iso9660 #82

Iso9660 #82

ghost commented Feb 3, 2018

KOLANICH Feb 4, 2018 •

edited

Loading

KOLANICH Feb 4, 2018

ghost Feb 5, 2018

KOLANICH Feb 4, 2018

ghost Feb 5, 2018

KOLANICH Feb 4, 2018 •

edited

Loading

ghost Feb 4, 2018

KOLANICH Feb 4, 2018 •

edited

Loading

ghost Feb 5, 2018

KOLANICH commented Apr 4, 2021

KOLANICH Apr 4, 2021 •

edited

Loading

ghost Apr 4, 2021

KOLANICH Apr 4, 2021

ghost Apr 4, 2021

KOLANICH Apr 4, 2021

ghost Apr 4, 2021

KOLANICH Apr 4, 2021

ghost Apr 6, 2021

KOLANICH Apr 6, 2021

ghost Apr 6, 2021

KOLANICH Apr 6, 2021

KOLANICH Apr 4, 2021 •

edited

Loading

ghost Apr 4, 2021

generalmimon Apr 6, 2021

ghost Apr 7, 2021

armijnhemel Jan 4, 2022

armijnhemel Jan 4, 2022

KOLANICH Jan 4, 2022

ghost Jan 5, 2022

armijnhemel Jan 5, 2022

armijnhemel Jan 5, 2022

armijnhemel Jan 5, 2022 •

edited

Loading

armijnhemel Jan 5, 2022

armijnhemel Jan 5, 2022

	- id: specific
	type:
	switch-on: signature
	cases:
	'signature::aaip_attribute_list': susp_unknown # AL
	'signature::rras_amiga_specific': rras_as # AS
	'signature::susp_continuation_area': susp_ce # CE
	'signature::rrip_child_link': rrip_cl # CL
	'signature::susp_extensions_reference': susp_er # ER
	'signature::susp_extension_selector': susp_es # ES
	'signature::rrip_alternate_name': rrip_nm # NM
	'signature::susp_padding_field': susp_pd # PD
	'signature::rrip_parent_link': rrip_pl # PL
	'signature::rrip_posix_device_number': rrip_pn # PN
	'signature::rrip_posix_file_attributes': rrip_px # PX
	'signature::rrip_relocated_directory': rrip_re # RE
	'signature::rrip_extensions_in_use_indicator': susp_unknown # RR
	'signature::rrip_sparse_file': rrip_sf # SF
	'signature::rrip_symbolic_link': susp_sl # SL
	'signature::susp_indicator': susp_sp # SP
	'signature::susp_terminator': susp_st # ST
	'signature::rrip_time_file': rrip_tf # TF
	'signature::rrzf_zisofs': rrzf_zf # ZF

	susp_unknown: # default for now
	seq:
	- id: length
	type: u1
	- id: version
	type: u1
	- id: data

	rras_as:
	doc-ref: rras
	seq:
	- id: length
	type: u1
	- id: version
	type: u1
	valid: 1
	- id: reserved

	susp_ce:
	doc-ref: susp 5.1
	seq:
	- id: length
	type: u1
	- id: version
	type: u1
	valid: 1
	- id: ca_location

	header:
	seq:
	- id: signature
	type: u2be
	enum: signature
	- id: specific

	susp_pd:
	doc-ref: susp 5.2
	seq:
	- id: length
	type: u1
	- id: version
	type: u1
	valid: 1
	- id: padding_area
	size: length - 4

Iso9660 #82

Iso9660 #82

Conversation

ghost commented Feb 3, 2018

KOLANICH Feb 4, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

KOLANICH Feb 4, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

KOLANICH Feb 4, 2018 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

KOLANICH commented Apr 4, 2021

KOLANICH Apr 4, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

KOLANICH Apr 4, 2021 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

armijnhemel Jan 5, 2022 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

KOLANICH Feb 4, 2018 •

edited

Loading

KOLANICH Feb 4, 2018 •

edited

Loading

KOLANICH Feb 4, 2018 •

edited

Loading

KOLANICH Apr 4, 2021 •

edited

Loading

KOLANICH Apr 4, 2021 •

edited

Loading

armijnhemel Jan 5, 2022 •

edited

Loading