All Stories

What is OCR? – Diggernaut can help!

Optical character recognition (OCR) is a technology that accurately recognizes printed and written text characters by a computer. It might involve photo scanning of...
David Kostyshock
1 min read

Json to XML, or “transform in 6 seconds.”

Hi folks. I want to share with you some details about our engine. As you know, it is written in Go. We use a...
Evgeniy Solomanidin
1 min read

Using JSON schema to validate your data

Recently we added a couple of neat functions which let you work with data more efficiently. So one of these functions is JSON schema...
Mikhail Sisin
3 min read

Extracting data from El Paso County Sheriff Blotter

This article can be used for educational purposes if you want to learn how to work with our Excavator application. We assume that you...
Mikhail Sisin
4 min read

What is most efficient language for web scraping…

We decided to do this small test to find out what is more efficient (speed, CPU and RAM usage wise) programming language for web...
Mikhail Sisin
59 sec read

Diggernaut can run JS routines

How to execute JavaScript snippet in the middle of the scraping process? Sometimes, when you scrape something, you may need to calculate some parameter...
Evgeniy Solomanidin
1 min read

I want to have my data flow directly…

Do you want to save data to RDBMS? It’s not that hard. So today I’m going to show you how easy it can be...
Evgeniy Solomanidin
1 min read

Self-hosted scraping? No problem!

Do you want to scrape some data from the website, but you are not ready to deploy your web scraper to the cloud and...
Mikhail Sisin
2 min read

How journalists can save time with data extraction.

Journalists have a difficult job. They have to do much research, and that takes time. Facts and figures are vital to a story or...
David Kostyshock
1 min read

What to do when server respond with JSON?

Diggernaut makes it easy to work with JSON format by converting it to XML. So now I’ll show you how you can work with...
Evgeniy Solomanidin
3 min read