Why does an expression like `(!”foo” .*)` generate arrays of `[undefined, char]`-values in PEG.js

Question

I'm still pretty new to PEG.js, and I'm guessing this is just a beginner misunderstanding. In trying to parse something like this: I can get a grammar to properly read the three section (to be further parsed later, of course.) But it generates that text in an odd format. For instance, in the above, "some text" turns into I can

Accepted Answer

Negative look ahead e.g. !Rule, will always return undefined, will fail if the Rule match.The dot . will always match a single character.A sequence Rule1 Rule2 ... will create a list with the results of each ruleA repetition Rule+ or Rule* will match Rule as many times as possible and create a list. (+ fails if the first attempt to match rule fails)Your results are[ // Start (!"nif" .)  [undefined // First "nif", "s" // First .] // first ("nif" .), [undefined, "o"] // Second (!"nif" .), [undefined, "m"], [undefined, "e"], [undefined, " "],   [undefined, "t"], [undefined, "e"], [undefined, "x"], [undefined, "t"]] // This list is (!"nif" .)*, all the matches of ("nif" .)What you seem to want is to read the text instead, and you can use the operator $Rule for this, it will return the input instead of the produced output.MainObject  = _ defs:DefSection _ condition:CondSection _ consequent: ConsequentSection    {return {defs, condition, consequent}}DefSection = _ "definitions"i _ defs:$(!"nif" .)+  {return defs.trim()}CondSection = _ "if"i _ cond:$(!"nthen" .)+  {return cond.trim()}ConsequentSection = _ "then"i _ cons:$(.*)  {return cons.trim()} _ "whitespace"  = [ tnr]*Will produce{   "defs": "some text",   "condition": "some additonal text    to parse here",   "consequent": "still more text will    go here"}

Advertisement

Answer