[thelist] Regex

Daniel J. Cody djc at members.evolt.org
Thu Feb 28 16:12:01 CST 2002


Hey Burhan -

This is from a sed script i hve, but should work for you..

<[^>]*>
<[^<]*>

'[^>]' specifies a "non-'>' character and the '*' after it completes
this expression to mean zero or more non-'>' characters

same for <

hth!

.djc.

Burhan Khalid wrote:
> Hey List :
>
>         Need some help here. Trying to figure out a regex pattern for all
> valid html tags. Basically, it should grab anything like
> <HTML> or <html> or <HtML> or <a> or <A>. So far, not have too much luck.
> Here is what I've got :
>
> (<\/?\w+)(>/\s+)
>
> Probably not even close.
>
> Thanks,
> Burhan
>





More information about the thelist mailing list