aboutsummaryrefslogtreecommitdiff
path: root/testdata
AgeCommit message (Collapse)Author
2025-06-12all: refactoring, move brokenlinks code to its own packageShulhan
When two or more struct has the same prefix that means it is time to move it to group it. Also, we will group one command to one package in the future.
2025-06-12all: rename the json field page_links to broken_linksShulhan
Naming it page_links does not make sense if the result is from brokenlinks command.
2025-06-05all: encode the whole BrokenlinksResult struct to JSONShulhan
Previously, we only encode the BrokenlinksResult.PageLinks. The struct may changes in the future, so its better to encode the whole struct now rather than changing the output later.
2025-06-05all: add option to scan pass resultShulhan
The brokenlinks command now have option "-past-result" that accept path to JSON file from the past result. If its set, the program will only scan the pages with broken links inside that report.
2025-05-30all: cleaning up fragment on linksShulhan
The fragment part on URL, for example "/page#fragment" should be removed, otherwise it will indexed as different URL.
2025-05-29all: add test cases for broken link and invalid URLShulhan
2025-05-29all: ignore HTML page from external domainShulhan
Any HTML link that is from domain other than the scanned domain should net get parsed. It only check if the link is valid or not.
2025-05-29all: parse only link to HTML pageShulhan
For link to image we can skip parsing it.
2025-05-29testdata/web: remove anchor to external websiteShulhan
The test should not require internet connection to be passed.
2025-05-29all: add case for broken HTMLShulhan
Turn out broken HTML still get parsed by "net/html" package.
2025-05-27all: complete the first minimum working implementationShulhan
The current implementation at least cover 84% of the cases. Todo, * CLI for scan * add more test case for 100% coverage, including scan on invalid base URL, scan on invalid HTML page, scan on invalid href or src image
2025-05-22deadlinks: a program to scan for dead links on websiteShulhan