Home

Welcome to my personal wiki space. Mainly it organizes Terence's blog

Example tree rewriting with patterns
Last changed Nov 30, 2008 12:57 by Terence Parr

So, let's do some rewriting using the pattern matching filter=true mode. Again, the VecMath.g parser will build trees but we'll avoid building an entire tree grammar. We'll focus on some patterns we want to rewrite.

Here's the grammar to build trees. …

Read more…

Posted at Nov 30, 2008 by Terence Parr | 0 comments
Woohoo! Tree pattern matching, rewriting a reality
Last changed Nov 30, 2008 11:00 by Terence Parr

Can't resist showing off new filter mode for tree grammars (this is working in my dev branch). Imagine I built some trees with Cymbal.g and want to define symbols and push/pop scopes. Previously you had to give full tree grammar even though we'd only have actions in a few spots and don't care about structure (we trust tree builder). By doing tree pattern matching, we get to focus only on those subtrees we care about. …

Read more…

Posted at Nov 29, 2008 by Terence Parr | 0 comments
Implementing tree pattern matching
Last changed Nov 22, 2008 15:23 by Terence Parr
Unknown macro: {down, up}
, apply=
Unknown macro: {once,repeat}
Posted at Nov 22, 2008 by Terence Parr | 0 comments
tree pattern matching grammars
Last changed Oct 23, 2008 15:08 by Terence Parr

Filter mode for Lexers

So, filter mode for Lexers is incredibly useful. For example, I use it for my wiki to HTML translation. you just specify some rules in the lexer with filter=true and it tries all of the rules against the input stream. Precedence is given to rules specified first. In other words, the lexer tries to match the first rule against the current input location. If it fails, it moves to the next rule in tries it. It tries rules until it finds one that matches or it fails. …

Read more…

Posted at Oct 23, 2008 by Terence Parr | 1 comment
Increasing LL prediction strength

Spent hours and hours on airplanes recently bouncing around Europe giving talks. had some time to think about increasing the recognition strength of LL by increasing the prediction mechanism from a DFA to a pushed down machine, either LL or LR. I've scanned my notes and diagrams; see the attachments on this news item. I tried all sorts of things trying to approximate non-regular lookahead languages but didn't really come up with anything great. One thing of note, …

Read more…

Posted at Jun 26, 2008 by Terence Parr | 0 comments
Revisiting Smalltalk language
Last changed Jun 11, 2008 01:34 by Terence Parr

As Java gets more and more complicated with generics, closures, etc... I keep looking for simplicity. I have the Mantra prototype, but even that subset is kind of complicated. I decided to look at Smalltalk again. Coincidentally, Nik Boyd, who built Bistro years ago contacted me; he's moved to SF Bay area, which means we can chat in person. …

Read more…

Posted at Jun 10, 2008 by Terence Parr | 0 comments
Context-sensitive error handling
Last changed Jun 06, 2008 13:35 by Terence Parr

The Default error handling mechanisms that does single token insertion and deletion works really well in many cases. The one area where I think ANTLR need some work is during no viable alternative exceptions and loops around rule references. For example, what happens if you have a fragment like:

d : decl+ ;
decl: 'foo'
    | 'bar'
    ;

If you have input ")) foo bar foo", which is valid except for the crazy "))" at the front, …

Read more…

Posted at Jun 06, 2008 by Terence Parr | 0 comments
Error recovery random thoughts
Last changed May 09, 2008 16:50 by Terence Parr

Currently syntax errors cause invalid trees and possibly even runtime exceptions when building ASTs. What we really need I believe is to have rules that encounter syntax errors return an ERROR node of some sort that records where the error occurred and, with luck, the tokens consumed during recovery. I started an improvement request:

http://www.antlr.org:8888/browse/ANTLR-193

The basic idea is that ERROR nodes get used in place of ASTs that would normally be produced by rule indications. …

Read more…

Posted at May 09, 2008 by Terence Parr | 0 comments
Rewrite rules
Last changed May 03, 2008 10:43 by Terence Parr

just had some cool ideas about a semantic rule specification language...i should write that up too... (also think about multi-threaded rewriting; no side-effects required

Some language translation problems can be described with a few rewrite rules that are predicated upon their context and potentially an arbitrary Boolean expression. For example, consider a few identity transformations such as


expr: // in context of expr, match left side, …

Read more…

Posted at Apr 11, 2008 by Terence Parr | 0 comments
Automatic StringTemplate construction in ANTLR grammars
Last changed Apr 11, 2008 11:37 by Terence Parr

Currently ANTLR does not create templates for you automatically when you use output=template option. This is because, when I first implemented it, I had no idea what the right answer was here. I did not know how to deal with whitespace and so on. I think I have the answer now. First, let me remind you that output=AST builds a completely flat tree given no instructions to the contrary. Similarly, the template output should reproduce the input given no instructions.

Read more…

Posted at Apr 11, 2008 by Terence Parr | 0 comments
Still more about expression parsing
Last changed Jun 19, 2008 07:09 by Terence Parr

Ok, Kay Roepke is in town and we've been discussing the faster expression parsing, among other things. Look for another entry on default StringTemplate generation or a parsing and tree parsing.

Found a ref to Keith Clarke's original recursive-descent precedence work

ANTLR v3.2 will allow special rules for specifying expressions that are particularly efficient both in speed and space. …

Read more…

Posted at Apr 10, 2008 by Terence Parr | 0 comments
Faster expression parsing for ANTLR
Last changed Apr 08, 2008 15:23 by Terence Parr

I should be working on something else but got to thinking about how annoying it is specifying expressions in recursive descent parsers. You have to have a new rule for each precedence level. This is also very slow. Just to match 34 it has to descend about 15 method calls. I built a prototype single-rule (plus primary and suffix) operator matching thingie which I enclose below. I should be able to generate it from some metameta syntax in antlr. For example, …

Read more…

Posted at Mar 23, 2008 by Terence Parr | 2 comments
Mantra pipe speed improvement
Last changed Oct 11, 2007 12:22 by Terence Parr

I replaced my blocking queue sitting between pipeline actors (threaded consumer/producers) and speed of my word freq program dropped from 5.2s to 3.5s on 5M file simply by replacing the blocking queue with my new ping-pong buffered version. That is close to the 3.0s achieved by the nonthreaded version.

// threaded, pipeline version; 3.5s (from 5.2s) on 5M file
f => Words() => { string w | ... }

// nonthreaded, nested map operation on big list; 3.0s
f. …

Read more…

Posted at Oct 11, 2007 by Terence Parr | 2 comments
Mixins part deux

So I am narrowing down my understanding of how to use mixins. The Comparable example is awesome, but does not need fields. In my effort to reduce the size of the average mantra object, I needed to remove the in and out fields from every object. The should be moved to an element that all of the actors can include or inherit from. I did not want to force all actors to inherit from the same base class, so I decided that a mixin was perfect but mixins don't currently allow fields. …

Read more…

Posted at Oct 08, 2007 by Terence Parr | 0 comments
Mantra 1.0a1 released

Finally got something ready for people to look at. Main components are in.

http://www.linguamantra.org/

Posted at Oct 05, 2007 by Terence Parr | 2 comments
Enter labels to add to this page:
Please wait 
Looking for a label? Just start typing.