Crawler

A crawler to get all a tags of a page. It only crawls the targets domain. If you crawl example.com which has a link to whatever.com, whatever.com will not be crawled.

Instantiation

$crawl = new Crawler();

Set target to crawl

$crawl->crawl('https://example.com');

Get crawled links

Gets all the crawled links of that domain as a one dimensional array.

$crawl->getCrawledLinks();

Defaults

Scheme allowed

http
https

Extensions allowed

html
htm

Options

You can set and get allowed schemes and file extensions.

Setting allowed file extensions

$crawl->allowed('set', 'allowedFiles', '.pdf', '.png');

Removing allowed file extensions

$crawl->allowed('remove', 'allowedFiles', '.pdf', '.png');

Removing allowed schemes

$crawl->allowed('remove', 'allowedSchemes', 'http');

Getting allowed file extensions

$crawl->allowed('get', 'allowedFiles');

Getting allowed schemes

$crawl->allowed('get', 'allowedSchemes');

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/ISSUE_TEMPLATE		.github/ISSUE_TEMPLATE
php		php
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Crawler

Instantiation

Set target to crawl

Get crawled links

Defaults

Scheme allowed

Extensions allowed

Options

Setting allowed file extensions

Removing allowed file extensions

Removing allowed schemes

Getting allowed file extensions

Getting allowed schemes

About

Releases

Packages

Languages

License

JustinThiede/crawler

Folders and files

Latest commit

History

Repository files navigation

Crawler

Instantiation

Set target to crawl

Get crawled links

Defaults

Scheme allowed

Extensions allowed

Options

Setting allowed file extensions

Removing allowed file extensions

Removing allowed schemes

Getting allowed file extensions

Getting allowed schemes

About

Topics

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages