Camila Javiera Muñoz Navarro

@CamilaJaviera91

Hey!! I'm from Chile and I'm trying to learn the wonders of data engineering

Chile

306

Followers

338

Following

Public Repos

Private Repos

Language Breakdown

Lines of code distribution across 26 owned repositories

381K Total LOC

Python

357,628 lines

93.8%

N/A

HTML

13,548 lines

3.6%

N/A

Shell

4,112 lines

1.1%

N/A

CSS

3,659 lines

1.0%

N/A

1,846 lines

0.5%

N/A

Other

275 lines

0.1%

N/A

I-Shaped Developer

I-shaped

Specialist — deep expertise in Python

Python

HTML

Shell

CSS

Collaboration Network

Global Impact visualization

LIVE

0 active collaborators

Repos

PRs

Growth

+18%

Top Collaborators

No collaborator data yet.

Coding Streak

Contribution activity over the past year

211 days

1,996

Contributions

1,944

Commits

Pull Requests

Jun Jul Aug Sep Oct Nov Dec Jan Feb Mar Apr May Jun

Based on GitHub activity

Less

Followers 306

Blandskron

@Blandskron

Kerry

@kerryjanes

Devy

@devycyan

Anna

@annaveth

Cane

@canesterin

View All

Following

338 total

Mariyam Siddiqui

@MariyamSiddiqui

Hossein Hezami

@hosseinhezami

TEJANAIK

@TejaNaik15

Jakub Frieske

@jakub-frieske

Dr. Partha Majumder

@DrParthaMajumder

View All Network

Synced via GitHub

Top Repositories

mock-data-factory

Generate large-scale synthetic datasets using SQL and BigQuery.

3 0

Python

pyspark-first-approach

This code demonstrates how to integrate PySpark with datasets and perform simple data transformations. It loads a sample dataset using PySpark's built-in functionalities or reads data from external sources and converts it into a PySpark DataFrame for distributed processing and manipulation.

3 0

Python

Sales-Prediction-Using-PostgreSQL

This project is designed to extract sales data from a PostgreSQL database, process it, and use a Random Forest model to predict sales quantities. It also visualizes real and predicted sales for better understanding.

3 0

Python

bagging-with-kaggle

Code in which an initial approach to decision trees and bagging will be made, and an attempt will be made to ensure that the model can be trained with any dataset coming from Kaggle (for this, we will again use the 'connect with Kaggle' project).

3 0

Python

sql-to-googlesheets

This repository provides a set of scripts to extract data from a MySQL database, transform it into a CSV file, and integrate it with Google Sheets. The workflow includes database connection, querying, data transformation, and file generation.

3 1

Python

clean-data-with-googlesheets

This project includes two main scripts: `cvs_to_sheets.py` and `google_sheets_utils.py`. These scripts allow data processing from Google Sheets, performing data cleaning and analysis, and generating charts in a PDF file. Additionally, the processed results can be saved back to Google Sheets.

3 0

Python

search-dataset-from-kaggle

With this code you can search and download any data from kaggle

3 1

Python

String-Distance-Metrics

Este repositorio contiene una colección de métodos y algoritmos de métricas de distancia y similitud diseñados para cuantificar el grado de parecido entre dos campos de texto (cadenas) provenientes de dos archivos de datos diferentes.

2 0

Python

mini-gcp

This project simulates a modern data pipeline architecture, entirely locally. It follows a modular design to extract, transform, load, validate, and analyze synthetic sales data using Python, Apache Beam, DuckDB, and PostgreSQL.

2 0

Python

gcp-new

This project defines a modern data pipeline architecture using Airflow, DBT, and PostgreSQL. Below you'll find instructions on how to get started and how the repository is structured.

2 0

Python

Open Source Impact

Contributions to external projects

364 merged PRs

No external contributions found.