Mikhail Sisin Co-founder of cloud-based web scraping and data extraction platform Diggernaut. Over 10 years of experience in data extraction, ETL, AI, and ML.

Extracting price and product information from Calypso St Barth online store

8 min read

Extracting price and product information from Calypso St Barth online store

Since the launch of its first store in 1992, Calypso St. Barth grew out of a modest boutique of resort clothes to a luxurious, recognizable worldwide brand. Designers, inspired by the travels, culture and art of different countries, create amazing and feminine models that women of all ages like. If you like a mixture of light elegance and exotic ideas, you will definitely enjoy the assortment of this online store. Our free scraper can be used for extracting price and product information from Calypso St Barth online store. Also it let you extract product images.

Approx number of goods: 2000
Approx number of page requests: 2000
Recommended subscription plan: Free

PLEASE NOTE! The number of requests can exceed the number of products, because data about variations, images, etc. can be scraped from other resources and will require additional requests. Also part of the product data can be delivered using XHR requests, which also increases the total number of required page requests.

How to use the web scraper to extract data about goods and prices from calypsostbarth.com

To use the web scraper for Calypso St Barth store’s website, you must have an account with our Diggernaut service. You can just simply follow this comprehensive guide:

  1. Go through this registration link to open free account with Diggernaut
  2. After registering and confirming the email address, you will need to log in to your account
  3. Create a project with any name and description, if you do not know how to do it, please refer to our documentation
  4. Switch to the created project and create a digger with any name, if you do not know how to do it, please refer to our documentation
  5. Copy the following digger configuration to the clipboard and paste it into the digger you created, if you do not know how to do it, refer to our documentation
  6. Switch the mode of the digger from Debug to Active, if you do not know how to do it, please refer to our documentation
  7. Run your digger and wait until the completion, if you do not know how to do it, please refer to our documentation
  8. Download the scraped dataset in the format you need, if you do not know how to do it, please refer to our documentation

You can also setup a schedule for running your scraper and collect data regularly.

Scraping configuration for the digger

---
config:
    debug: 2
    agent: Firefox
do:
- walk:
    to: http://www.calypsostbarth.com/new-arrivals/clothing
    do:
    - find: 
        path: .col-side > ul > li 
        do: 
        - variable_clear: cat1
        - variable_clear: cat2
        - variable_clear: cat3
        - find: 
            path: a
            slice: 0
            do: 
            - parse
            - space_dedupe
            - trim
            - normalize:
                routine: lower
            - variable_set: cat1
        - parse:
            format: html
        - register_set: 
  • <%register%>
- to_block - find: path: div > ul > li > ul > li do: - variable_clear: cat2 - variable_clear: cat3 - find: path: a slice: 0 do: - parse - space_dedupe - trim - normalize: routine: lower - variable_set: cat2 - parse: attr: href - normalize: routine: url - pool_clear: default - parse: attr: href - link_add - walk: to: links do: - find: path: .paging li > a:matches([Nn]ext) do: - parse: attr: href - link_add - find: path: .item > a do: - parse: attr: href - walk: to: value do: - object_new: product - find: in: doc path: head do: - eval: routine: js body: '(function (){var d = new Date(); return d.toISOString()})();' - object_field_set: object: product field: date - static_get: url - object_field_set: object: product field: url - find: path: .tabbed li:nth-of-type(1) do: - parse - space_dedupe - trim - if: match: (\S) do: - object_field_set: object: product field: description - find: path: body do: - register_set: Calypso St. Barth - object_field_set: object: product field: brand - variable_get: cat1 - if: match: (\S) do: - object_field_set: object: product field: categories joinby: "|" - variable_get: cat2 - if: match: (\S) do: - object_field_set: object: product field: categories joinby: "|" - variable_get: cat3 - if: match: (\S) do: - object_field_set: object: product field: categories joinby: "|" - find: path: .product-name > h1 do: - parse - space_dedupe - trim - object_field_set: object: product field: name - find: path: img.cloudzoom-gallery, .image > img do: - parse: attr: data-cloudzoom - eval: routine: js body: (function () {var e = {<%register%>}; return JSON.stringify(e);})(); - normalize: routine: json2xml - to_block - find: path: zoomImage do: - parse - normalize: routine: url - object_field_set: object: product field: images joinby: "|" - find: path: script:matches(var\s*spConfig) do: - parse: filter: - Product\.Config\((.+)\); - normalize: routine: json2xml - to_block - find: path: 'attributes > *:hasChild(code:matches(color)) > options > label' do: - parse - space_dedupe - trim - object_field_set: object: product field: variations joinby: "|" - find: path: body_safe > productid do: - parse - space_dedupe - trim - object_field_set: object: product field: sku - find: path: body_safe > baseprice do: - parse - if: match: (\d) do: - object_field_set: object: product field: price type: float - register_set: USD - object_field_set: object: product field: currency - object_save: name: product - find: path: ul > li > a do: - parse - space_dedupe - trim - normalize: routine: lower - variable_set: cat3 - parse: attr: href - normalize: routine: url - pool_clear: default - parse: attr: href - link_add - walk: to: links do: - find: path: .paging li > a:matches([Nn]ext) do: - parse: attr: href - link_add - find: path: .item > a do: - parse: attr: href - walk: to: value do: - object_new: product - find: in: doc path: head do: - eval: routine: js body: '(function (){var d = new Date(); return d.toISOString()})();' - object_field_set: object: product field: date - static_get: url - object_field_set: object: product field: url - find: path: .tabbed li:nth-of-type(1) do: - parse - space_dedupe - trim - if: match: (\S) do: - object_field_set: object: product field: description - find: path: body do: - register_set: Calypso St. Barth - object_field_set: object: product field: brand - variable_get: cat1 - if: match: (\S) do: - object_field_set: object: product field: categories joinby: "|" - variable_get: cat2 - if: match: (\S) do: - object_field_set: object: product field: categories joinby: "|" - variable_get: cat3 - if: match: (\S) do: - object_field_set: object: product field: categories joinby: "|" - find: path: .product-name > h1 do: - parse - space_dedupe - trim - object_field_set: object: product field: name - find: path: img.cloudzoom-gallery, .image > img do: - parse: attr: data-cloudzoom - eval: routine: js body: (function () {var e = {<%register%>}; return JSON.stringify(e);})(); - normalize: routine: json2xml - to_block - find: path: zoomImage do: - parse - normalize: routine: url - object_field_set: object: product field: images joinby: "|" - find: path: script:matches(var\s*spConfig) do: - parse: filter: - Product\.Config\((.+)\); - normalize: routine: json2xml - to_block - find: path: 'attributes > *:hasChild(code:matches(color)) > options > label' do: - parse - space_dedupe - trim - object_field_set: object: product field: variations joinby: "|" - find: path: body_safe > productid do: - parse - space_dedupe - trim - object_field_set: object: product field: sku - find: path: body_safe > baseprice do: - parse - if: match: (\d) do: - object_field_set: object: product field: price type: float - register_set: USD - object_field_set: object: product field: currency - object_save: name: product

Sample of scraped data

Below is a sample of a dataset with several products in JSON format (so you can easily review it and see data structure). The dataset can be downloaded as CSV, XLSX, XML, or any other text format using the templates.

[{
    "product": {
        "brand": "Calypso St. Barth",
        "categories": "new arrivals|clothing",
        "currency": "USD",
        "date": "2017-12-27T09:48:47.804Z",
        "description": "By CALYPSO St. Barth - Welcome cooler weather with this lavish suede jacket. Supple soft and chic, this jacket flatters with a feminine fit and lengthy silhouette. Accented with a ruffled trim, this autumn favorite features hook and eye closure and welt pockets. Wear with anything from casual basics to night out ensembles, a special wardrobe addition!",
        "images": "http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/d/i/diosajacket4_1.1502486097.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/d/i/diosajacket3_1.1502486097.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/d/i/diosajacket2_1.1502486098.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/d/i/diosajacket5_1.1502486098.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/d/i/diosajacket_1.1502486099.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/d/i/diosajacket.1502486099.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/d/i/diosajacket4_1.1502486097.jpg",
        "name": "Diosa Suede Ruffle Jacket",
        "price": 1100,
        "sku": "86060",
        "url": "http://www.calypsostbarth.com/new-arrivals/clothing/diosa-suede-ruffle-jacket",
        "variations": "ATMN GLD"
    }
}
,{
    "product": {
        "brand": "Calypso St. Barth",
        "categories": "new arrivals|clothing",
        "currency": "USD",
        "date": "2017-12-27T09:48:48.837Z",
        "description": "By CALYPSO St. Barth - Transcend into a new dimension of the open cardigan and welcome a versatile piece, essential to both dressy & casual outfitting. Composed of cozy alpaca wool, this sweater stands in for a lavish cold weather scarf, poncho or transitional jacket. Accented with ribbed detailing at the opening flaps, the cardigan’s lengthy back center seam connects to inside out seam at waist for figure enhancing structure amidst an oversized silhouette. Wear it over dresses, pants or loungewear – it’s the perfect piece to carry you into the colder seasons.",
        "images": "http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/a/r/artatlaskosencocoon_20.1504818057.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/a/r/artatlaskosencocoon_35.1504818057.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/a/r/artatlaskosencocoon_12_ppn_.1504818058.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/a/r/artatlaskosencocoon_28_ppn_.1504818059.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/s/e/serpinadress_11_1.1504818059.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/a/r/artatlaskosencocoon_20.1504818057.jpg",
        "name": "Kosen Wool Sweater Cocoon",
        "price": 395,
        "sku": "86562",
        "url": "http://www.calypsostbarth.com/new-arrivals/clothing/kosen-wool-sweater-cocoon",
        "variations": "LT GREY"
    }
}
,{
    "product": {
        "brand": "Calypso St. Barth",
        "categories": "new arrivals|clothing",
        "currency": "USD",
        "date": "2017-12-27T09:48:49.120Z",
        "description": "By CALYPSO St. Barth - Casual dressing made easy, this lightweight cotton dress is comfortable, casual and oh so chic. A relaxed silhouette, this effortless mini is graced with ruffled flounce sleeves. Back yoke and square neckline are accented with shirring details that flow into a loose body. Finished with side seam pockets, this dress will your new “go-to” favorite this season.",
        "images": "http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/a/r/artemiadress4.1503599057.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/a/r/artemiadress2.1503599058.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/a/r/artemiadress3.1503599058.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/a/r/artemiadressmain.1503599059.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/a/r/artemiadress.1503599060.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/e/m/emptyname_13.1503599060.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/a/r/artemiadress4.1503599057.jpg",
        "name": "Artemia Flounce Sleeve Dress",
        "price": 225,
        "sku": "86442",
        "url": "http://www.calypsostbarth.com/new-arrivals/clothing/artemia-flounce-sleeve-dress",
        "variations": "Black"
    }
}
,{
    "product": {
        "brand": "Calypso St. Barth",
        "categories": "new arrivals|clothing",
        "currency": "USD",
        "date": "2017-12-27T09:48:49.430Z",
        "description": "By CALYPSO St. Barth - Boho chic meets outdoorsy elegance in this fleece poncho. Adorned with a tribal motif in earthy grey hues, it is finished with olive green crochet detailing along the lower trim, front zip and pockets as well as the shoulder sleeves. Gold zippers are enhanced with taupe leather trim for an added touch of bohemian charm. Luxuriously soft and cozy, this is the perfect outerwear piece to take you into the fall season",
        "images": "http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/b/a/bahamutponcho_26.1504799605.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/b/a/bahamutponcho_46.1504799605.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/b/a/bahamutponcho_85.1504799606.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/b/a/bahamutponcho_79.1504799607.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/1/1/11_2_1640127.1504799608.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/w/i/winterlooks39684.1504799608.jpg|http://www.calypsostbarth.com/media/catalog/product/cache/1/image/975x/040ec09b1e35df139433887a97daa66f/b/a/bahamutponcho_26.1504799605.jpg",
        "name": "Bahamut Fleece Pullover Poncho",
        "price": 395,
        "sku": "86563",
        "url": "http://www.calypsostbarth.com/new-arrivals/clothing/bahamut-fleece-pullover-poncho-86639",
        "variations": "Grey CC"
    }
}]
Mikhail Sisin Co-founder of cloud-based web scraping and data extraction platform Diggernaut. Over 10 years of experience in data extraction, ETL, AI, and ML.

Leave a Reply

Your email address will not be published. Required fields are marked *