protected ArrayList getURL(string txtIn)
{
ArrayList outURL = new ArrayList();
Regex r = new Regex("href\\s*=\\s*(?:(?:\\\"(?<url>[^\\\"]*)\\\")|(?<url>[^\\s]* ))");
MatchCollection mc1 = r.Matches(txtIn);
foreach (Match m1 in mc1)
{
foreach (Group g in m1.Groups)
{
outURL.Add(g.Value);
}
}
return outURL;
}
Tags: c#, html, regex
[...] How to extract URLs (href property) from HTML [...]
Good site!
brainybusiness.info
Interestingly, even for accountants
))))