A few weeks ago, Gunnar Morling announced the One Billion Row Challenge, described as follows: “write a Java program for […]
A few weeks ago, Gunnar Morling announced the One Billion Row Challenge, described as follows: “write a Java program for […]
Pansynchro is designed from the ground up to be as fast as possible. Now, anyone can say “our product is […]
A lot of words have been spilled various corners of the data engineering world regarding whether ETL or ELT is […]
So you’ve got a network sync job set up. But simply copying the data isn’t enough; you want to run […]
In the first part, we built a very simple, local data sync script. But most data sync work isn’t running […]
A Pansynchro data pipeline has three major components: a reader that ingests the data from the source, a writer that […]
We speak of ETL and ELT, that data should be (E)xtracted from a source and (L)oaded into a destination, and […]
One of the simplest possible ETL tasks is to clone a database. Extract and load, with no transforms, from a […]