Respect nofollow links #32
Labels
No labels
bug
docs
duplicate
enhancement
good first issue
help wanted
question
type: js
type: ruby
unsure
wontfix
No milestone
No project
No assignees
1 participant
Notifications
Due date
No due date set.
Dependencies
No dependencies set.
Reference
dan/siteinspector#32
Loading…
Add table
Add a link
Reference in a new issue
No description provided.
Delete branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
The tool at present appears to not respect nofollow links. This resulted in a scan against a site enumerating all the facets and pagination in a given search page (to the extent of 122,000 pages).
It would be great if by default nofollow links were respected. There generally is a good reason when people use these.
@seanhamlin adding the 'exclude' xpath selector should solve this issue:

I don't think that making ref=nofollow excluded by default is a good idea since some people might expect those pages being crawled.