ETL Conagua

What is it?

This repository is a small project consisting of an ETL pipeline using Spark Scala and a public API:

Request the follwing endpoint to download the GZIP about weather foerecast in Mexico per day by municipality: https://smn.conagua.gob.mx/tools/GUI/webservices/?method=1
Converts the GZIP into a json file
Reads the data with Spark and write it into a parquet

How to use it locally?

It is pretty simple, you just need to check if sbt and scala is appropiately installed

To install dependencies:

sbt compile

If everything went good then run to run the app locally in your machine.

sbt "runMain etl.Main"

Run tests

sbt test

How to use it in a container?

Run docker build -t etl-conagua-scala . to build the docker image.

Run docker run -it --rm etl-conagua-scala after building the image, you'll see the results in the shell.

How to use it in a Spark cluster?

Run sbt assemlbly to build the jar. This will run the tests as well before building the Jar.

Change the permissions of the shell script chmode 777 spark-submit-script.sh

Run ./spark-submit.sh

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.bsp		.bsp
.github/workflows		.github/workflows
.vscode		.vscode
data		data
project		project
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build.sbt		build.sbt
spark-submit-script.sh		spark-submit-script.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

ETL Conagua

What is it?

How to use it locally?

Run tests

How to use it in a container?

How to use it in a Spark cluster?

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

axiom-of-choice/etl-conagua-scala

Folders and files

Latest commit

History

Repository files navigation

ETL Conagua

What is it?

How to use it locally?

Run tests

How to use it in a container?

How to use it in a Spark cluster?

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages