๐ Extract URLs
Paste any block of text and instantly extract every URL and link found. One URL per line, with an option to remove duplicates. Free, private, no signup.
About the URL Extractor
This tool scans any block of text for URLs starting with http:// or https:// and lists them one per line. It works on plain text, HTML source, Markdown, emails, documents and any other text that contains links. Everything runs in your browser with no data sent to a server.
Strip Trailing Punctuation
In natural language text, URLs are sometimes followed by a period, comma or closing parenthesis โ for example "Visit https://example.com.". Enable Strip Trailing Punctuation to automatically clean these characters off the end of each extracted URL.
Common Use Cases
- Extracting all links from a web page's HTML source code
- Pulling URLs out of a long document or email thread
- Building a link list from scraped or exported content
- Auditing all outbound links in a block of marketing copy
- Collecting citation URLs from an academic text
- Deduplicating repeated links found in newsletters
Frequently Asked Questions
What types of URLs does this extract?
+The tool extracts any URL beginning with http:// or https://. This includes URLs with paths, query strings, fragments and subdomains. Very long URLs with special characters and encoded characters are handled correctly.
Can I extract URLs from HTML source?
+Yes. The extractor finds URLs anywhere in the text including inside HTML attributes like href, src and action. Paste raw HTML source and all absolute URLs will be extracted. Relative URLs like /page or ../image.png are not matched since they lack the http/https prefix.
Why are some URLs cut off?
+In prose text, URLs are sometimes followed immediately by punctuation like a full stop or comma. Enable Strip Trailing Punctuation to remove these from extracted URLs. If you are extracting from HTML or code, this option is usually not needed.
Is the extraction private?
+Completely private. All processing happens locally in your browser. Your text is never sent to any server, stored or logged. You can safely process confidential documents, internal pages or proprietary content.