[thelist] Regex Help
Noah St. Amand
noah at tookish.net
Sat Feb 4 00:57:04 CST 2006
Hi Volkan,
VOLKAN ÖZÇELI.K wrote (2/4/06 1:38 AM):
> Can you give some matching and non-matching examples so that we may be
> clear on what to match and what not to match. To me, your specs seem
> too wide to be understood by just reading.
Sure -- thanks for the advice. This is going to be a bit long, though.
Given this string:
-----
Lorem ipsum dolor sit amet, congue congue hendrerit et nam sit. Magna
etiam purus, tincidunt mi mauris fringilla feugiat tristique velit,
metus at nec mollis lorem vehicula consequatur, volutpat lacinia quod,
consectetuer amet mauris eu erat sollicitudin. Vivamus pharetra
suspendisse vestibulum.
-----
I would want to extract:
-----
Lorem ipsum dolor sit amet, congue congue hendrerit et nam sit.
-----
Given:
-----
Lorem ipsum dolor sit amet, congue congue hendrerit et nam sit.
Magna etiam purus, tincidunt mi mauris fringilla feugiat tristique
velit, metus at nec mollis lorem vehicula consequatur, volutpat lacinia
quod, consectetuer amet mauris eu erat sollicitudin. Vivamus pharetra
suspendisse vestibulum.
-----
I would want:
-----
Lorem ipsum dolor sit amet, congue congue hendrerit et nam sit.
-----
Given:
-----
Lorem ipsum dolor sit <a href="http://www.example.com/>amet</a>, congue
congue hendrerit et nam sit. Magna etiam purus, tincidunt mi mauris
fringilla feugiat tristique velit, metus at nec mollis lorem vehicula
consequatur, volutpat lacinia quod, consectetuer amet mauris eu erat
sollicitudin. Vivamus pharetra suspendisse vestibulum.
-----
I would want:
-----
Lorem ipsum dolor sit <a href="http://www.example.com/>amet</a>, congue
congue hendrerit et nam sit.
-----
There are a few other possibilities, but I think this covers the areas
that are giving me problems. Specifically, it's the last one, working
around the periods in the link. The regex I have so far is:
"/^[^\.]*\.\s/"
Thanks again,
Noah
More information about the thelist
mailing list