← Back to Blog

Deep Dive: URL Captures

The complete metadata taxonomy—11 categories, 115+ fields extracted from web pages.

By Rouslan Zenetl

When you share a URL, Recall fetches the page and extracts structured metadata into 11 categories.

1. Content Classification

Identifies what kind of content this is and how the source categorizes it.

FieldSwift TypeDescription
typeString?Content type: article, video, product, etc.
categoryString?Topic category from source
sectionString?Website section: Opinion, News, etc.
descriptionString?Summary or excerpt
keywordsString?Comma-separated topic tags
languageString?ISO 639-1 language code
localeString?Full locale identifier (en_US)
localesString?Available locales, comma-separated
canonicalUrlString?Canonical URL for the content
licenseString?Content license

2. Authorship

Who created and published the content.

FieldSwift TypeDescription
authorString?Primary author name
authorsString?All authors, comma-separated
contributorsString?Contributors, comma-separated
publisherString?Publisher organization
organizationString?Organization name
copyrightString?Copyright statement
editorString?Editor name

3. Temporal

When content was created, updated, and for media, how long it runs.

FieldSwift TypeDescription
publishedDateDate?Publication timestamp
modifiedDateDate?Last modification timestamp
expirationDateDate?Content expiration date
createdDateDate?Creation timestamp
durationString?Media duration (ISO 8601: PT15M30S)

4. Media Assets

Visual and audio content. Thumbnails are automatically downloaded and cached.

FieldSwift TypeDescription
thumbnailUrlString?Preview image URL
imageUrlString?Primary image URL
imageWidthString?Image width in pixels
imageHeightString?Image height in pixels
videoUrlString?Video URL
audioUrlString?Audio URL

5. Ratings & Reviews

User ratings for products and reviewed content.

FieldSwift TypeDescription
ratingValueString?Rating value (e.g., “4.5”)
ratingBestString?Maximum rating (e.g., “5”)
ratingCountString?Number of ratings
reviewCountString?Number of reviews

6. Commercial

E-commerce data. Captures point-in-time price and availability.

FieldSwift TypeDescription
priceString?Product price
priceCurrencyString?ISO 4217 currency code
availabilityString?Stock status
brandString?Product brand
skuString?Stock keeping unit
conditionString?new, used, refurbished

7. Identifiers

Academic and publishing identifiers for citation.

FieldSwift TypeDescription
isbnString?International Standard Book Number
issnString?International Standard Serial Number
doiString?Digital Object Identifier
issueNumberString?Issue or volume number

8. Geographic

Location data for events, venues, and place-based content.

FieldSwift TypeDescription
locationNameString?Venue or place name
streetAddressString?Street address
cityString?City name
regionString?State/province/region
countryString?Country name
postalCodeString?ZIP/postal code

9. Relationships

Series and episodic content positioning.

FieldSwift TypeDescription
seriesNameString?Series or collection name
seasonNumberString?Season number
episodeNumberString?Episode number
volumeNumberString?Volume number
partOfSeriesString?Series URL or identifier

10. Social

Author and publisher social media presence.

FieldSwift TypeDescription
twitterUsernameString?Twitter/X handle
twitterSiteIdString?Twitter site ID
facebookAppIdString?Facebook app ID
instagramUsernameString?Instagram handle
linkedinProfileString?LinkedIn profile URL

11. Site-Specific

Platform-specific fields for major sites.

YouTube

FieldSwift TypeDescription
videoIdString?YouTube video ID
channelString?Channel name
channelUrlString?Channel URL
durationString?Video duration
viewCountString?View count

GitHub

FieldSwift TypeDescription
repositoryString?Repository name
ownerString?Repository owner
starsString?Star count
forksString?Fork count
languageString?Primary language
topicsString?Topics, comma-separated

Twitter/X

FieldSwift TypeDescription
tweetIdString?Tweet ID
usernameString?Username
retweetsString?Retweet count
likesString?Like count
timestampDate?Tweet timestamp

Amazon

FieldSwift TypeDescription
asinString?Amazon Standard ID
productGroupString?Product category
featuresString?Key features

Medium

FieldSwift TypeDescription
publicationString?Publication name
readTimeString?Estimated read time
clapCountString?Clap count

ArXiv

FieldSwift TypeDescription
arxivIdString?ArXiv identifier
categoriesString?Subject categories
pdfUrlString?PDF download URL

Product Hunt

FieldSwift TypeDescription
productIdString?Product identifier
votesCountString?Upvote count
commentsCountString?Comment count

All fields optional—populated only when source HTML contains relevant data. Extracted from Open Graph, Twitter Cards, Schema.org, Dublin Core, and platform-specific meta tags.

References

  • Open Graph Protocol — Facebook’s metadata standard for rich link previews
  • Twitter Cards — X/Twitter’s markup for enhanced tweets
  • Schema.org — Structured data vocabulary for search engines
  • Dublin Core — Metadata standard for digital resources
  • JSON-LD — Linked data format often used with Schema.org