/og-tag-scrapper

A service to scrape an input URL and parse its OG metadata. If the page has OG parameters set exclusively, then the API will return all the OG parameters.

Primary LanguageJavaScript

og-tag-scrapper

og tag scrapper is an api to scrap og tags / meta tags and return title, description, images in the api

Description

Og Tag Scrapper exposes an api that will take a site url and return the meta tags values of title, description, and images used in the site. It will first try to find the Open Graph (og) meta tags on the site. If there any above of og tags not found it will fallback to site meta tags.

POST  / 
payload {"url": ""}

Sample CURL to use the API

curl --location --request POST 'https://hl82j05f4m.execute-api.us-east-1.amazonaws.com/dev/api/v1/meta-tag' \
--header 'Authorization: *********' \
--header 'Content-Type: application/json' \
--data-raw '{
    "url": "https://www.npmjs.com/package/dotenv"
}'

Note: This api has basic auth enabled. Without authentication, it will throw 401 error.

  • To use the api need to add valid basic auth credentials in the place of *********
  • In the api the url can be sent in the url attribute of request body's to scrap and retrieve meta tags

  • Basic Auth cred is added as environment variable for basic security on API layer.
  • and meta tags are added in environment variable too to update meta params dynamically if needed in future.

Release Notes:

  • Currently this is deployed in dev env.
  • Caching layer like Elasticache / DAX can be added for performance improvement ( Future scope)