I'm about to finish this thing. Its mostly modeled after Expat.
Its all perl, mine is faster parsing about 1 meg a second.
Its also complient will current xml standards on w3c.org.
There's so much to it, I don't think I want to post it here.
I would like to make it into a "free" module on cpan or Active States
release version.
I think its commercial level. The fact is I can "interject" special
searches and handling if I want to. It is designed using the specs
from here:
http://www.w3.org/TR/xml11/#NT-AttValue
Its version 1.1 If I'm using the wron specs, please let me know.
Its awsome, tremendously fast.
I am going to also write a full featured "schema checker" using this
base parser. I've never seen something so easy as schema checking.
Thinking beyond I will move into modification tools. Even style sheet
mods (i think, its all too easy now). I will do it all in markup.
The code is about 600 lines now. I could plop it down here. I have
all constructs covered in the above 1.1 specs. I'm worried a little
about encoding and unicode. By an large, I've never seen anything
so easy in my life. I fear that my code is approacing a proffessional
level and I may "not" want to just plop it down here.
I may want to contact AS or Cpan to post the module so its not ripped
off. However, I know I could do a schema checker in a week. Since its
all so easy now, I'm wondering if I can make any money at this or is it
all just a give-away...
Oh well, from a homeless man to a middle class man, I know it won't be
that much. However, I have developed tools that could do conversions.
Yea sure I want to put my stuff in the public domain, but the internals
I do with them could do fast custom conversions.
What do you think? Say it now, if it ends up in AS or Cpan you won't have
the option to reccommend. It will arrive there, but whats the money behind
hard core conversions, style, schema, filters, anything?