Camila Javiera Muñoz Navarro

Camila Javiera Muñoz Navarro

@CamilaJaviera91

Hey!! I'm from Chile and I'm trying to learn the wonders of data engineering

Chile
306
Followers
338
Following
26
Public Repos
0
Private Repos

Language Breakdown

Lines of code distribution across 26 owned repositories

381K Total LOC
Python
357,628 lines
93.8%
N/A
HTML
13,548 lines
3.6%
N/A
Shell
4,112 lines
1.1%
N/A
CSS
3,659 lines
1.0%
N/A
R
1,846 lines
0.5%
N/A
Other
275 lines
0.1%
N/A
I

I-Shaped Developer

I-shaped

Specialist — deep expertise in Python

Python
HTML
Shell
CSS
R

Collaboration Network

Global Impact visualization

LIVE
Camila Javiera Muñoz Navarro
0 active collaborators

Repos

26

PRs

0

Growth

+18%

Top Collaborators

No collaborator data yet.

Coding Streak

Contribution activity over the past year

211 days
1,996
Contributions
1,944
Commits
41
Pull Requests
Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun
Mo
We
Fr
Based on GitHub activity
Less
More

Top Repositories

mock-data-factory

Generate large-scale synthetic datasets using SQL and BigQuery.

3 0
Python
pyspark-first-approach

This code demonstrates how to integrate PySpark with datasets and perform simple data transformations. It loads a sample dataset using PySpark's built-in functionalities or reads data from external sources and converts it into a PySpark DataFrame for distributed processing and manipulation.

3 0
Python
Sales-Prediction-Using-PostgreSQL

This project is designed to extract sales data from a PostgreSQL database, process it, and use a Random Forest model to predict sales quantities. It also visualizes real and predicted sales for better understanding.

3 0
Python
bagging-with-kaggle

Code in which an initial approach to decision trees and bagging will be made, and an attempt will be made to ensure that the model can be trained with any dataset coming from Kaggle (for this, we will again use the 'connect with Kaggle' project).

3 0
Python
sql-to-googlesheets

This repository provides a set of scripts to extract data from a MySQL database, transform it into a CSV file, and integrate it with Google Sheets. The workflow includes database connection, querying, data transformation, and file generation.

3 1
Python
clean-data-with-googlesheets

This project includes two main scripts: `cvs_to_sheets.py` and `google_sheets_utils.py`. These scripts allow data processing from Google Sheets, performing data cleaning and analysis, and generating charts in a PDF file. Additionally, the processed results can be saved back to Google Sheets.

3 0
Python
search-dataset-from-kaggle

With this code you can search and download any data from kaggle

3 1
Python
String-Distance-Metrics

Este repositorio contiene una colección de métodos y algoritmos de métricas de distancia y similitud diseñados para cuantificar el grado de parecido entre dos campos de texto (cadenas) provenientes de dos archivos de datos diferentes.

2 0
Python
mini-gcp

This project simulates a modern data pipeline architecture, entirely locally. It follows a modular design to extract, transform, load, validate, and analyze synthetic sales data using Python, Apache Beam, DuckDB, and PostgreSQL.

2 0
Python
gcp-new

This project defines a modern data pipeline architecture using Airflow, DBT, and PostgreSQL. Below you'll find instructions on how to get started and how the repository is structured.

2 0
Python

Open Source Impact

Contributions to external projects

364 merged PRs

No external contributions found.