LanzMining

Get data

Since scraping is increasingly disliked and excluded from the most terms of use, I recommend a less automated process. Another aspect is the new ZDF-Webstreaming application, which requires to be familiar with selenium or other webdriver-tooling.

I'd recommend using Obsidian Web Clipper together with an AI providers API endpoint. You can also host you'r own open source model and use you'r own endpoints.

You can use my configs and customize for you'r own vault or application.

Install

Install python3.12.* on you'r computer
Either install lanzmining with pdm, poetry or directly from [requirements.txt]

Process data

To finally obtain you'r dataset you can use the lanzmining processors like so: python src/main.py -c config/vault.json -o output.csv If you want to change the parsing processes, you can create a snapshot of the raw data: python src/main.py -c config/vault.json -o output.csv -snapshot-file snapshots/snapshot.csv, for later usage. Backup you vault, before applying the obsidian clipper templates. To merge a snapshot with a new vault created from updated templates: python src/main.py -c config/vault.json -o output.csv -merge-file snapshots/snapshot.csv.

Visualizations

All visualizations for the talk are build with d3js, so I decided to wrap it in a svelte project. If you'r not familiar with it, all vis code is build in simple js. You can find it at visuals/src/lib/visualisations.

Name		Name	Last commit message	Last commit date
Latest commit History 38 Commits
configs		configs
lanzmining		lanzmining
visuals		visuals
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

LanzMining

Get data

Install

Process data

Visualizations

About

Uh oh!

Releases

Packages

Languages

arrrrrmin/lanz-mining

Folders and files

Latest commit

History

Repository files navigation

LanzMining

Get data

Install

Process data

Visualizations

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages