What is a Data Engineer?
| aws | | redshift | | bigdata | | dataengineer |
Read MoreHow to fuzzy match in Redshift
| aws | | redshift | | bigdata |
Fuzzy Merging — Photo by Markus Spiske on Unsplash
How to Ingest DynamoDB JSON Data into Redshift
| dynamodb | | redshift | | bigdata |
Photo by Rodion Kutsaev on Unsplash
Providing Valuable Data to a Business as a Data Engineer
| dataengineer | | datawarehouse | | bigdata |
It can be easy to overlook the complexities of being a data engineer. After all we’re just moving data from one place to another right: how hard can it be?
How to Build a Fast Data Warehouse with Amazon Redshift
| redshift | | aws | | datawarehouse | | bigdata |
Data Warehousing: Simplified
Data Science 101
| datascience | | machinelearning | | bigdata |
NoSQL vs SQL | Which one should I choose?
| bigdata | | nosql | | database |
Intro to Apache HBase
| bigdata | | hadoop | | hbase |
How to Scale Spark Streaming Applications
| bigdata | | streaming | | spark |
Twitter EU Referendum Analysis
| python | | datascience | | bigdata |
Building a search engine with Elasticsearch
| elasticsearch | | bigdata | | searching |
Spark Performance Tuning | 5 ways to improve performance of Spark Applications
| bigdata | | spark | | performance |
What is Apache Kudu in 5 Minutes
| bigdata | | storage | | kudu |
Why use Apache Kafka for real time streaming applications
| bigdata | | streaming | | kafka |
A Day at Strata and Hadoop World
| bigdata | | hadoop | | conference |
Apache Flume
| ingestion | | flume | | bigdata |
In this post I will be discussing some simple use cases and important features of Apache Flume.
Apache Sqoop Advanced
| bigdata | | sqoop | | ingestion |
Here we are going to look at using Sqoop to do a little more than simply import or export data, including ensuring data imported is in a suitable format for use with Hive.