Edi Reader does not respect X12 advised control characters #19

cleftheris · 2016-10-05T07:50:01Z

In all known EDI formats there are default control characters these are documented in the default EdiGrammar presets.

EdiGrammar.NewEdiFact()
EdiGrammar.NewX12()
EdiGrammar.NewTradacoms()

Now in the EdiFact and X12 cases there is a way for the transmitter (he who sends the edi transmission)
to advise differently for each of the control characters.

Control characters are considered to be the segment, element & component seperators plus the escape character.

In X12 the current implementation completely ignores them.

The text was updated successfully, but these errors were encountered:

cleftheris · 2016-10-05T08:34:42Z

@raholland79 A temp workaround until this is fixed would be to alter the default grammar with different presets:

public class CustomX12Grammar : EdiGrammar
{
    public CustomX12Grammar : base() 
    {
            _ComponentDataElementSeparator = '>',
            _DataElementSeparator = '*',
            _DecimalMark = '.',
            _ReleaseCharacter = null,
            _Reserved = new char[0],
            _SegmentTerminator = '~',
            _ServiceStringAdviceTag = null,
            _InterchangeHeaderTag = "ISA",
            _FunctionalGroupHeaderTag = "GS",
           _MessageHeaderTag = "ST",
            _MessageTrailerTag = "SE",
            _FunctionalGroupTrailerTag = "GE",
            _InterchangeTrailerTag = "IEA",
    }
}

raholland79 · 2016-10-05T12:11:22Z

So in the case of

ISA_00_ 00 _02_SCAC _ZZ_MGCTLYST 160726_0836_U_00400_000002356_0_T>

We have no segment terminator - How do I represent that?

Also using the example given, on every file I try, I get this error

Invalid character after parsing segment name. Expected '*' but got: *. Path '', line 1, position 3.

cleftheris · 2016-10-05T12:21:10Z

sorry about that. My intention was to show how you could work around this by testing different character sets for the delimiters. Since it is not clear without the complete edi transmition what they are.

Consider the above as the default configuration.

I am guessing that a space is used as the DataElementSeparator = ' ' and the > as _SegmentTerminator = '>' for the ComponentDataElementSeparator I have no idea.

cleftheris · 2016-10-05T12:37:01Z

take a look this is what I found about what X12 990 looks like. Its a sample I found googling about it

ISA*00*          *00*          *12*4405197800     *01*999999999      *111219*1742*U*00400*000000003*0*P*>

cleftheris · 2016-10-05T13:26:17Z

Ok according to these guys here

EDI data is exchanged in text files. With this in mind, EDI standards recommendations do not result in a viewable file, and common industry practice produces corrupted files when collisions occur between data and delimiters.

this means that there is a great possibility that your delimiters are not visible.

According to the same link standards suggest you use the following control characters (some of them invisible). Try this:

public class CustomX12Grammar : EdiGrammar
{
    public CustomX12Grammar : base() 
    {
           _ComponentDataElementSeparator = '>'; // for 990  
           //_ComponentDataElementSeparator = ':'; // for your 214 
           _SegmentNameDelimiter = (char)Int16("001D", System.Globalization.NumberStyles.AllowHexSpecifier);
           _DataElementSeparator = (char)Int16("001D", System.Globalization.NumberStyles.AllowHexSpecifier);
           _SegmentTerminator = (char)Int16("001C", System.Globalization.NumberStyles.AllowHexSpecifier);
            _DecimalMark = '.',
            _ReleaseCharacter = null,
            _Reserved = new char[0],
            _ServiceStringAdviceTag = null,
            _InterchangeHeaderTag = "ISA",
            _FunctionalGroupHeaderTag = "GS",
           _MessageHeaderTag = "ST",
            _MessageTrailerTag = "SE",
            _FunctionalGroupTrailerTag = "GE",
            _InterchangeTrailerTag = "IEA",
    }
}

PS: You could try to see the Unicode char by reading the file to text and debugging in visual studio to make sure.

raholland79 · 2016-10-05T13:26:56Z

That matches mine exactly'

However, using the default config above for files that conform to the default configuration I still get:

Invalid character after parsing segment name. Expected '*' but got: *. Path '', line 1, position 3.

The code is the default as you've written above - and the 214 works fine with the default NewX12()

But using the custom grammar gives the expect * but got * error.


var customGrammar = new CustomX12Grammar();
using (var stream = new StreamReader(@"C:\sandbox\drdispatch\EDI\assets\214-MGCTLYST-SAMPLE.EDI"))
    var interchange = new EdiSerializer().Deserialize<Mg214Interchange>(stream, customGrammar);

raholland79 · 2016-10-12T15:02:24Z

I can't get it working. I may have to try Edi Fabric - but would rather not. I have a customer wanting this done NOW though.

cleftheris · 2016-10-12T16:07:01Z

@raholland79 I can see if I can make this work but I need you to send me the original sample file (not tampered with), If this is sensitive information involved send me via email here: c.leftheris at live.com . I need the originals because any file edited with a text editor may have stripped out the invisible control characters we are searching for.

I have been working hard to release a first version that includes the Serialization (writing back to EDI) for a while now. Not an easy task since this is an opensource project I work on my free time. That said it just came out yesterday v1.1 and its out of the way.

raholland79 · 2016-10-12T16:43:33Z

Oh I know you're working hard - And I appreciate it immensely

I'd be willing to toss some cash your way for your hard work.

I've attached a zip with all 4 files.

If I remember right only the 214 is working out of the box.

EdiSampleFiles.zip

raholland79 · 2016-10-12T16:52:12Z

204 and 990 are top priority

cleftheris · 2016-10-12T18:44:48Z

@raholland79 Ok the issue here is that the new line \n is used as the special character for Segment Termination. This is considered as whitespace throughout the EdiTextReader and will probably take a day or two to figure out. In the meantime you could replace \n with another valid character like ~ before feeding the stream to Edi.Net and everything should work fine (provided you have the rest of the seperators setup correctly).

raholland79 · 2016-10-12T21:27:37Z

So basically replace the trailing \n on the ISA line in code with a ~ and process from there?

cleftheris · 2016-10-12T22:31:49Z

No you must replace every occurrence of the \n character with ~.
This will do the trick until this is fixed,

        [Fact]
        public void X12_204_Test() {
            var grammar = EdiGrammar.NewX12();
            grammar.SetAdvice(
                segmentNameDelimiter: '*', 
                dataElementSeparator: '*', 
                componentDataElementSeparator: ':', 
                segmentTerminator: '~', 
                releaseCharacter: null, 
                reserved: null, 
                decimalMark: '.');

            string text = File.ReadAllText(@"C:\sandbox\drdispatch\EDI\assets\204-MGCTLYST-SAMPLE.EDI");
            var segmentCount = 0;
            using (var stream = new MemoryStream(Encoding.UTF8.GetBytes(text.Replace('\n', '~')))) {
                interchange = new EdiSerializer().Deserialize<Models.Transportation_204>(new StreamReader(stream), grammar);
            }
        }

this reads the whole file into a string. Then replaces the character and at last it feeds the result to the EdiSerializer.

related to #19 and #24

raholland79 · 2016-10-14T13:00:22Z

My idiot vendor now says the terminator should be ~ on all files - I'm reconfirming to be sure - but If so this makes the library work as is without any work around and you can backlog this particular bug for more important work.

I'll let you know ASAP.

Again, thanks for the wonderful hard work and excellent communication.

raholland79 · 2016-10-18T01:57:47Z

They're good with ~ so - while this may be a bug I don't need it handled.

cleftheris · 2016-10-18T07:31:05Z

@raholland79 glad you made it work. Issue #24 is still one of my top priorities.

Thanks.

PGP-Protector · 2017-03-30T18:38:28Z

I did find a way to solve this issue (Not nice ,but works)
Was having an issue with some FedEx X12 214 Files.
The EdiGrammar.NewX12(); would default to ISA16 = '~' only
And wouldn't actually read the Character in that field '!' (Why???)
Each time it would complain that there's too many characters.

My solution ? Not nice but solves the
Invalid character after parsing segment name. Expected '*' but got: *. Path '', line 1, position 3.
In using the above example code.
Created a New Class Grammer:IEdiGrammer and just used the original EDIGrammer code as a base, but with the values I needed. (Dropped the unused sections also)
FedExX12214.zip

File attached shows the method I did.

Better solution would be to allow proper reading of the ISA16 field though.

cleftheris · 2017-03-31T09:02:30Z

Hi @PGP-Protector,

Unfortunately this is still an issue. The main reason is that I did not find the time. Changing the code to be able to treat \n (#24) was not trivial and that has stalled this issue as well.

Regarding your workaround, it seems fine to me. The only improvement (suggestion) would be to use the SetAdvice method that can override the characters without you implementing the whole thing.

var grammar = EdiGrammar.NewX12();
grammar.SetAdvice(
                segmentNameDelimiter: '*', 
                dataElementSeparator: '*', 
                componentDataElementSeparator: ':', 
                segmentTerminator: '!', 
                releaseCharacter: null, 
                reserved: null, 
                decimalMark: '.');

…in EdiTextReader related to #19

cleftheris added the bug label Oct 5, 2016

cleftheris mentioned this issue Oct 5, 2016

Trouble With Some X.12 #15

Closed

cleftheris added the X12 label Oct 12, 2016

cleftheris added enhancement and removed bug labels Oct 13, 2016

cleftheris mentioned this issue Oct 13, 2016

Cannot use '\n' the line feed as a Segment seperator #24

Closed

cleftheris added a commit that referenced this issue Oct 13, 2016

Added samples to test for the \n issue.

25c158d

related to #19 and #24

This was referenced Oct 4, 2017

Component Element Separator is always null #67

Open

Deserialize then reserialize of x12.850.edi has different result? #68

Closed

cleftheris mentioned this issue Apr 16, 2018

Stack empty InvalidOp exception parsing X12 #88

Closed

cleftheris mentioned this issue May 23, 2018

ISA/15 always null #92

Closed

cleftheris added a commit that referenced this issue Sep 18, 2018

support new line as a segment terminator in EdiTextReader closes #24 #19

98b44f3

cleftheris added a commit that referenced this issue Sep 21, 2018

Support windows style CRLF endings with \r as the segment terminator …

39af5cd

…in EdiTextReader related to #19

cleftheris mentioned this issue Jul 12, 2019

Generating x12 856 #133

Open

cleftheris mentioned this issue Jul 8, 2022

Trying out EDI.NET, working with different vendors #231

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Edi Reader does not respect X12 advised control characters #19

Edi Reader does not respect X12 advised control characters #19

cleftheris commented Oct 5, 2016 •

edited

Loading

cleftheris commented Oct 5, 2016 •

edited

Loading

raholland79 commented Oct 5, 2016

cleftheris commented Oct 5, 2016

cleftheris commented Oct 5, 2016

cleftheris commented Oct 5, 2016 •

edited

Loading

raholland79 commented Oct 5, 2016

raholland79 commented Oct 12, 2016

cleftheris commented Oct 12, 2016

raholland79 commented Oct 12, 2016

raholland79 commented Oct 12, 2016

cleftheris commented Oct 12, 2016

raholland79 commented Oct 12, 2016

cleftheris commented Oct 12, 2016 •

edited

Loading

raholland79 commented Oct 14, 2016

raholland79 commented Oct 18, 2016

cleftheris commented Oct 18, 2016

PGP-Protector commented Mar 30, 2017

cleftheris commented Mar 31, 2017 •

edited

Loading

Edi Reader does not respect X12 advised control characters #19

Edi Reader does not respect X12 advised control characters #19

Comments

cleftheris commented Oct 5, 2016 • edited Loading

cleftheris commented Oct 5, 2016 • edited Loading

raholland79 commented Oct 5, 2016

cleftheris commented Oct 5, 2016

cleftheris commented Oct 5, 2016

cleftheris commented Oct 5, 2016 • edited Loading

raholland79 commented Oct 5, 2016

raholland79 commented Oct 12, 2016

cleftheris commented Oct 12, 2016

raholland79 commented Oct 12, 2016

raholland79 commented Oct 12, 2016

cleftheris commented Oct 12, 2016

raholland79 commented Oct 12, 2016

cleftheris commented Oct 12, 2016 • edited Loading

raholland79 commented Oct 14, 2016

raholland79 commented Oct 18, 2016

cleftheris commented Oct 18, 2016

PGP-Protector commented Mar 30, 2017

cleftheris commented Mar 31, 2017 • edited Loading

cleftheris commented Oct 5, 2016 •

edited

Loading

cleftheris commented Oct 5, 2016 •

edited

Loading

cleftheris commented Oct 5, 2016 •

edited

Loading

cleftheris commented Oct 12, 2016 •

edited

Loading

cleftheris commented Mar 31, 2017 •

edited

Loading