Currently browsing: Articles

January 28, 2024 admin Case Studies Comments Off

The One Billion Row Challenge: Optimizing CSV Performance

A few weeks ago, Gunnar Morling announced the One Billion Row Challenge, described as follows: “write a Java program for […]

November 20, 2023 admin Articles Comments Off

Pansynchro is designed from the ground up to be as fast as possible. Now, anyone can say “our product is […]

November 17, 2023 admin Articles Comments Off

A lot of words have been spilled various corners of the data engineering world regarding whether ETL or ELT is […]

November 17, 2023 admin Getting Started Comments Off

So you’ve got a network sync job set up. But simply copying the data isn’t enough; you want to run […]

November 16, 2023 admin Getting Started Comments Off

In the first part, we built a very simple, local data sync script. But most data sync work isn’t running […]

November 16, 2023 admin Getting Started Comments Off

A Pansynchro data pipeline has three major components: a reader that ingests the data from the source, a writer that […]

November 16, 2023 admin Articles Comments Off

We speak of ETL and ELT, that data should be (E)xtracted from a source and (L)oaded into a destination, and […]

November 16, 2023 admin Articles Comments Off

One of the simplest possible ETL tasks is to clone a database. Extract and load, with no transforms, from a […]