Regain 2.1.0-STABLE API

net.sf.regain.crawler.config
Interface UrlMatcher

All Known Implementing Classes:
PrefixUrlMatcher, RegexUrlMatcher, UrlMatcherResult

public interface UrlMatcher

A matcher that checks URLs whether they match to a certain pattern.

Author:
Tilman Schneider, STZ-IDA an der FH Karlsruhe

Method Summary
 boolean getShouldBeIndexed()
          Gets a flag whether the content should be indexed.
 boolean getShouldBeParsed()
          Gets a flag whether links should be extracted from the content
 boolean matches(String url)
          Checks whether a URL matches to the rules of this matcher.
 void setShouldBeIndexed(boolean shouldBeIndexed)
          Sets a flag whether the content should be indexed.
 void setShouldBeParsed(boolean shouldBeParsed)
          Sets a flag whether links should be extracted from the content
 

Method Detail

matches

boolean matches(String url)
Checks whether a URL matches to the rules of this matcher.

Parameters:
url - The URL to check.
Returns:
Whether the given URL matches to the rules of this matcher.

getShouldBeParsed

boolean getShouldBeParsed()
Gets a flag whether links should be extracted from the content

Returns:
Whether from a match for matches(url) possibly inclosed links should be extracted.

getShouldBeIndexed

boolean getShouldBeIndexed()
Gets a flag whether the content should be indexed.

Returns:
Whether from a match for matches(url) the content from the URL should be indexed.

setShouldBeParsed

void setShouldBeParsed(boolean shouldBeParsed)
Sets a flag whether links should be extracted from the content


setShouldBeIndexed

void setShouldBeIndexed(boolean shouldBeIndexed)
Sets a flag whether the content should be indexed.


Regain 2.1.0-STABLE API

Regain 2.1.0-STABLE, Copyright (C) 2004-2010 Til Schneider, www.murfman.de, Thomas Tesche, www.clustersystems.info