Re: grep is really pathetic sometimes

This WebDNA talk-list message is from

2003


It keeps the original formatting.
numero = 51318
interpreted = N
texte = Hi John, I tried that same code: >> code: [Text show=T]SKU=TROUTPRINT.JPG[/Text]

[grep search=^([A-Z0-9][0-9A-Z])([0-9A-Z][0-9A-Z][0-9A-Z])&replace=\1/\2/][getchar s start=1&end=5][SKU][/getchars][/grep] >> output: TROUTPRINT.JPG TR/OUT/ Raj ----- Original Message ----- From: "John Peacock" To: "WebDNA Talk" Sent: Wednesday, June 25, 2003 2:10 AM Subject: RFE: grep is really pathetic sometimes > > I love using regular expressions to process my text (which is why I spend most > of my time using Perl these days). However, one of my other developers came to > me to ask why a particular grep was working the way it was. > > The code is as follows: > > [grep > search=^([A-Z0-9][0-9A-Z])([0-9A-Z][0-9A-Z][0-9A-Z])&replace=\1/\2/][getchar s > start=1&end=5][SKU][/getchars][/grep] > > That *should* match the first two characters (as \1) and the next three (as \2). > It always seemed to work when we were only looking at numbers, but we had to > extend it to work with characters as well as and it suddenly started matching > twice for one item. > > For SKU=TROUTPRINT.JPG, we expected it to return "TR/OUT/", but it would return > "TR/OUT/PR/INT" instead. It turns out that [grep] doesn't understand the > beginning of line (BOL) anchor "^" character. Consequently, the pattern was > matching twice (because this item had a longer SKU than the other items). This > is very annoying and vastly reduces the usabililty of [grep] for me. > > Why couldn't WebCat provide a real regex engine like Perl Compatible Regular > Expressions: > > http://www.pcre.org/ > > instead of whatever hodgepodge of code it currently has? At the very least, > WebCat should have thrown some sort of an error for the "^" which it doesn't > understand. > > > John > > -- > John Peacock > Director of Information Research and Technology > Rowman & Littlefield Publishing Group > 4501 Forbes Boulevard > Suite H > Lanham, MD 20706 > 301-459-3366 x.5010 > fax 301-429-5748 > > > ------------------------------------------------------------- > This message is sent to you because you are subscribed to > the mailing list . > To unsubscribe, E-mail to: > To switch to the DIGEST mode, E-mail to > Web Archive of this list is at: http://webdna.smithmicro.com/ ------------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/ Associated Messages, from the most recent to the oldest:

    
  1. Re: grep is really pathetic sometimes ( John Peacock 2003)
  2. Re: grep is really pathetic sometimes ( "Rajeev Kumar" 2003)
  3. Re: grep is really pathetic sometimes ( John Peacock 2003)
  4. Re: grep is really pathetic sometimes ( "Rajeev Kumar" 2003)
  5. RFE: grep is really pathetic sometimes ( John Peacock 2003)
Hi John, I tried that same code: >> code: [Text show=T]SKU=TROUTPRINT.JPG[/Text]

[grep search=^([A-Z0-9][0-9A-Z])([0-9A-Z][0-9A-Z][0-9A-Z])&replace=\1/\2/][getchar s start=1&end=5][SKU][/getchars][/grep] >> output: TROUTPRINT.JPG TR/OUT/ Raj ----- Original Message ----- From: "John Peacock" To: "WebDNA Talk" Sent: Wednesday, June 25, 2003 2:10 AM Subject: RFE: grep is really pathetic sometimes > > I love using regular expressions to process my text (which is why I spend most > of my time using Perl these days). However, one of my other developers came to > me to ask why a particular grep was working the way it was. > > The code is as follows: > > [grep > search=^([A-Z0-9][0-9A-Z])([0-9A-Z][0-9A-Z][0-9A-Z])&replace=\1/\2/][getchar s > start=1&end=5][SKU][/getchars][/grep] > > That *should* match the first two characters (as \1) and the next three (as \2). > It always seemed to work when we were only looking at numbers, but we had to > extend it to work with characters as well as and it suddenly started matching > twice for one item. > > For SKU=TROUTPRINT.JPG, we expected it to return "TR/OUT/", but it would return > "TR/OUT/PR/INT" instead. It turns out that [grep] doesn't understand the > beginning of line (BOL) anchor "^" character. Consequently, the pattern was > matching twice (because this item had a longer SKU than the other items). This > is very annoying and vastly reduces the usabililty of [grep] for me. > > Why couldn't WebCat provide a real regex engine like Perl Compatible Regular > Expressions: > > http://www.pcre.org/ > > instead of whatever hodgepodge of code it currently has? At the very least, > WebCat should have thrown some sort of an error for the "^" which it doesn't > understand. > > > John > > -- > John Peacock > Director of Information Research and Technology > Rowman & Littlefield Publishing Group > 4501 Forbes Boulevard > Suite H > Lanham, MD 20706 > 301-459-3366 x.5010 > fax 301-429-5748 > > > ------------------------------------------------------------- > This message is sent to you because you are subscribed to > the mailing list . > To unsubscribe, E-mail to: > To switch to the DIGEST mode, E-mail to > Web Archive of this list is at: http://webdna.smithmicro.com/ ------------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://webdna.smithmicro.com/ "Rajeev Kumar"

DOWNLOAD WEBDNA NOW!

Top Articles:

Talk List

The WebDNA community talk-list is the best place to get some help: several hundred extremely proficient programmers with an excellent knowledge of WebDNA and an excellent spirit will deliver all the tips and tricks you can imagine...

Related Readings:

Problems with [Search] param - Mac Plugin b15 (1997) Purchase command error problem (1997) Removing [showif] makes a big difference in speed (1997) GuestBook example (1997) WebCat2b13MacPlugIn - more [date] problems (1997) Why isn't this working (1999) The max=0 issue is a bug ... CALL TO ACTION (2000) ShowNext Command (1997) Mail Server Needed. (1998) WebCat2 - Getting to the browser's username/password data (1997) 300 ticks (2003) OT: Development contracts (1999) AJAX with WebDNA (2006) Formulas.db + Users.db (1997) WebCatalog 3.0.4 alias crash bug? (2000) RE: Emailer prefs in two places ... (1997) OT: zipcodes (2002) hideif glitch (2003) Virtual hosting and webcatNT (1997) Showif for mulitple variations (1997)