🧪 Data Lab
Learning • Experimentation • Data Engineering Projects
> [!NOTE]
> Code, datasets, and designs may change as I refine implementations and adopt best practices.
---
Table of Contents
- About
- Projects
- Tools & Technologies
- Contact Me
---
## About
**Data Lab** is a personal sandbox for building and iterating on **data engineering projects**.
The focus is on:
- Designing reliable data pipelines
- Applying analytics engineering principles
- Working with batch data, lakehouse patterns, and validation
- Learning by building realistic systems rather than just toys
Projects range from small experiments to end-to-end pipelines using production-style tools.
(back to top)
---
# Projects
*Each project will be in its own Repo.*
## [Banking Transaction Pipeline](https://git.camcodes.dev/Cameron/Banking_Data_Pipeline)
- Spark-based ETL pipeline
- Bronze → Silver → Gold lakehouse design
- Data quality enforcement and validation
(back to top)
---
## Tools & Technologies
- **Languages:** Python, SQL, Java
- **Processing:** Apache Spark (PySpark)
- **Storage:** S3 storage
- **Data Formats:** Parquet, Delta-style layouts
- **Databases:** PostgreSQL, SQLite (project-dependent)
- **Visualization:** Tableau / Power BI
*Tools may expand as new projects are added.*
---
## 💬 Connect With Me
[](https://www.linkedin.com/in/cameron-css/) [](https://CamDoesData.com) [](https://www.kaggle.com/cameronseamons) [](mailto:CameronSeamons@gmail.com) [](https://drive.google.com/file/d/1YaM4hDtt2-79ShBVTN06Y3BU79LvFw6J/view?usp=sharing)
(back to top)