Get all URLs detected inside a Feed/Atom/RSS/Sitemap xml markup.
$ npm install xml-urls --save
const xmlUrls = require('xml-urls')
;(async () => {
const url = process.argv[2]
if (!url) throw new TypeError('Need to provide an url as first argument.')
const urls = await xmlUrls(url)
urls.forEach(url => console.log(url))
// => [
// 'http://www.sitemappro.com/',
// 'http://www.sitemappro.com/download.html',
// 'http://www.sitemappro.com/register.html',
// 'http://www.sitemappro.com/examples.html',
// 'http://www.sitemappro.com/company.html',
// 'http://www.sitemappro.com/contact.html',
// ...
// ]
})()
See more at examples.
Required
Type: string
Type: object
Use it for providing html-get#options.
Type: array
Default: []
A list of links to be excluded from the final output. It supports regex patterns.
See [matcher](https://github.com/sindresorhus/matcher#matcher-= for know more.
- html-urls – Get all urls from a HTML markup.
- css-urls – Get all URLs referenced from stylesheet files.
xml-urls © Kiko Beats, released under the MIT License.
Authored and maintained by Kiko Beats with help from contributors.
kikobeats.com · GitHub @Kiko Beats · X @Kikobeats