explosion
diff --git a/‎.gitignore
Lines changed: 128 additions & 0 deletions b/‎.gitignore
Lines changed: 128 additions & 0 deletions
diff --git a/‎LICENSE
Lines changed: 21 additions & 0 deletions b/‎LICENSE
Lines changed: 21 additions & 0 deletions
diff --git a/‎README.md
Lines changed: 238 additions & 0 deletions b/‎README.md
Lines changed: 238 additions & 0 deletions
@@ -0,0 +1,128 @@
+.vscode
+.prettierrc
+
+# Byte-compiled / optimized / DLL files
+__pycache__/
+*.py[cod]
+*$py.class
+
+# C extensions
+*.so
+
+# Distribution / packaging
+.Python
+build/
+develop-eggs/
+dist/
+downloads/
+eggs/
+.eggs/
+lib/
+lib64/
+parts/
+sdist/
+var/
+wheels/
+pip-wheel-metadata/
+share/python-wheels/
+*.egg-info/
+.installed.cfg
+*.egg
+MANIFEST
+
+# PyInstaller
+#  Usually these files are written by a python script from a template
+#  before PyInstaller builds the exe, so as to inject date/other infos into it.
+*.manifest
+*.spec
+
+# Installer logs
+pip-log.txt
+pip-delete-this-directory.txt
+
+# Unit test / coverage reports
+htmlcov/
+.tox/
+.nox/
+.coverage
+.coverage.*
+.cache
+nosetests.xml
+coverage.xml
+*.cover
+*.py,cover
+.hypothesis/
+.pytest_cache/
+
+# Translations
+*.mo
+*.pot
+
+# Django stuff:
+*.log
+local_settings.py
+db.sqlite3
+db.sqlite3-journal
+
+# Flask stuff:
+instance/
+.webassets-cache
+
+# Scrapy stuff:
+.scrapy
+
+# Sphinx documentation
+docs/_build/
+
+# PyBuilder
+target/
+
+# Jupyter Notebook
+.ipynb_checkpoints
+
+# IPython
+profile_default/
+ipython_config.py
+
+# pyenv
+.python-version
+
+# pipenv
+#   According to pypa/pipenv#598, it is recommended to include Pipfile.lock in version control.
+#   However, in case of collaboration, if having platform-specific dependencies or dependencies
+#   having no cross-platform support, pipenv may install dependencies that don't work, or not
+#   install all needed dependencies.
+#Pipfile.lock
+
+# celery beat schedule file
+celerybeat-schedule
+
+# SageMath parsed files
+*.sage.py
+
+# Environments
+.env
+.venv
+env/
+venv/
+ENV/
+env.bak/
+venv.bak/
+
+# Spyder project settings
+.spyderproject
+.spyproject
+
+# Rope project settings
+.ropeproject
+
+# mkdocs documentation
+/site
+
+# mypy
+.mypy_cache/
+.dmypy.json
+dmypy.json
+
+# Pyre type checker
+.pyre/
@@ -0,0 +1,21 @@
+MIT License
+
+Copyright (c) 2020 ExplosionAI GmbH
+
+Permission is hereby granted, free of charge, to any person obtaining a copy
+of this software and associated documentation files (the "Software"), to deal
+in the Software without restriction, including without limitation the rights
+to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+copies of the Software, and to permit persons to whom the Software is
+furnished to do so, subject to the following conditions:
+
+The above copyright notice and this permission notice shall be included in all
+copies or substantial portions of the Software.
+
+THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+SOFTWARE.
@@ -0,0 +1,238 @@
+<a href="https://explosion.ai"><img src="https://explosion.ai/assets/img/logo.svg" width="125" height="125" align="right" /></a>
+
+# spacy-streamlit: spaCy comoponents for Streamlit
+
+This package contains utilities for visualizing [spaCy](https://spacy.io) models
+and building interactive spaCy-powered apps with
+[Streamlit](https://streamlit.io). It includes various building blocks you can
+use in your own Streamlit app, like visualizers for **syntactic dependencies**,
+**named entities**, **text classification**, **semantic similarity** via word
+vectors, token attributes, and more.
+
+[![Current Release Version](https://img.shields.io/github/release/explosion/spacy-streamlit.svg?style=flat-square&logo=github)](https://github.com/explosion/spacy-streamlit/releases)
+[![pypi Version](https://img.shields.io/pypi/v/spacy-streamlit.svg?style=flat-square&logo=pypi&logoColor=white)](https://pypi.org/project/spacy-streamlit/)
+
+<img width="50%" align="right" src="https://user-images.githubusercontent.com/13643239/85388081-f2da8700-b545-11ea-9bd4-e303d3c5763c.png">
+
+## 🚀 Quickstart
+
+You can install `spacy-streamlit` from pip:
+
+```bash
+pip install spacy-streamlit
+```
+
+The package includes **building blocks** that call into Streamlit and set up all
+the required elements for you. You can either use the individual components
+directly and combine them with other elements in your app, or call the
+`visualizer` function to embed the whole visualizer.
+
+```python
+# streamlit_app.py
+import spacy_streamlit
+
+models = ["en_core_web_sm", "en_core_web_md"]
+default_text = "Sundar Pichai is the CEO of Google."
+spacy_streamlit.visualizer(models, default_text))
+```
+
+You can then run your app with `streamlit run streamlit_app.py`.
+
+### 📦 Example: [`01_out-of-the-box.py`](examples/01_out-of-the-box.py)
+
+Use the embedded visualizer with custom settings out-of-the-box.
+
+```bash
+streamlit run https://raw.githubusercontent.com/explosion/spacy-streamlit/master/examples/01_out-of-the-box.py
+```
+
+### 👑 Example: [`02_custom.py`](examples/02_custom.py)
+
+Use individual components in your existing app.
+
+```bash
+streamlit run https://raw.githubusercontent.com/explosion/spacy-streamlit/master/examples/02_custom.py
+```
+
+## 🎛 API
+
+### Visualizer components
+
+These functions can be used in your Streamlit app. They call into `streamlit`
+under the hood and set up the required elements.
+
+#### <kbd>function</kbd> `visualizer`
+
+Embed the full visualizer with selected components.
+
+```python
+import spacy_streamlit
+
+models = ["en_core_web_sm", "/path/to/model"]
+default_text = "Sundar Pichai is the CEO of Google."
+visualizers = ["ner", "textcat"]
+spacy_streamlit.visualizer(models, default_text, visualizers)
+```
+
+| Argument              | Type                | Description                                                                                                            |
+| --------------------- | ------------------- | ---------------------------------------------------------------------------------------------------------------------- |
+| `models`              | List[str]           | Names of loadable spaCy models (paths or package names). The models become selectable via a dropdown.                  |
+| `default_text`        | str                 | Default text to analyze on load. Defaults to `""`.                                                                     |
+| `visualizers`         | List[str]           | Names of visualizers to show. Defaults to `["parser", "ner", "textcat", "similarity", "tokens"]`.                      |
+| `ner_labels`          | Optional[List[str]] | NER labels to include. If not set, all labels present in the `"ner"` pipeline component will be used.                  |
+| `ner_attrs`           | List[str]           | Span attributes shown in table of named entities. See [`visualizer.py`](spacy_streamlit/visualizer.py) for defaults.   |
+| `token_attrs`         | List[str]           | Token attributes to show in token visualizer. See [`visualizer.py`](spacy_streamlit/visualizer.py) for defaults.       |
+| `similarity_texts`    | Tuple[str, str]     | The default texts to compare in the similarity visualizer. Defaults to `("apple", "orange")`.                          |
+| `show_json_doc`       | bool                | Show button to toggle JSON representation of the `Doc`. Defaults to `True`.                                            |
+| `show_model_meta`     | bool                | Show button to toggle model `meta.json`. Defaults to `True`.                                                           |
+| `sidebar_title`       | Optional[str]       | Title shown in the sidebar. Defaults to `None`.                                                                        |
+| `sidebar_description` | Optional[str]       | Description shown in the sidebar. Accepts Markdown-formatted text.                                                     |
+| `show_logo`           | bool                | Show the spaCy logo in the sidebar. Defaults to `True`.                                                                |
+| `color`               | Optional[str]       | Experimental: Primary color to use for some of the main UI elements (`None` to disable hack). Defaults to `"#09A3D5"`. |
+
+#### <kbd>function</kbd> `visualize_parser`
+
+Visualize the dependency parse and part-of-speech tags using spaCy's
+[`displacy` visualizer](https://spacy.io/usage/visualizers).
+
+```python
+import spacy
+from spacy_streamlit import visualize_parser
+
+nlp = spacy.load("en_core_web_sm")
+doc = nlp("This is a text")
+visualize_parser(doc)
+```
+
+| Argument        | Type          | Description                                  |
+| --------------- | ------------- | -------------------------------------------- |
+| `doc`           | `Doc`         | The spaCy `Doc` object to visualize.         |
+| _keyword-only_  |               |                                              |
+| `title`         | Optional[str] | Title of the visualizer block.               |
+| `sidebar_title` | Optional[str] | Title of the config settings in the sidebar. |
+
+#### <kbd>function</kbd> `visualize_ner`
+
+Visualize the named entities in a `Doc` using spaCy's
+[`displacy` visualizer](https://spacy.io/usage/visualizers).
+
+```python
+import spacy
+from spacy_streamlit import visualize_ner
+
+nlp = spacy.load("en_core_web_sm")
+doc = nlp("Sundar Pichai is the CEO of Google.")
+visualize_ner(doc, labels=nlp.get_pipe("ner").labels)
+```
+
+| Argument        | Type          | Description                                                                   |
+| --------------- | ------------- | ----------------------------------------------------------------------------- |
+| `doc`           | `Doc`         | The spaCy `Doc` object to visualize.                                          |
+| _keyword-only_  |               |                                                                               |
+| `labels`        | Sequence[str] | The labels to show in the labels dropdown.                                    |
+| `attrs`         | List[str]     | The span attributes to show in entity table.                                  |
+| `show_table`    | bool          | Whether to show a table of entities and their attributes. Defaults to `True`. |
+| `title`         | Optional[str] | Title of the visualizer block.                                                |
+| `sidebar_title` | Optional[str] | Title of the config settings in the sidebar.                                  |
+
+#### <kbd>function</kbd> `visualize_textcat`
+
+Visualize text categories predicted by a trained text classifier.
+
+```python
+import spacy
+from spacy_streamlit import visualize_textcat
+
+nlp = spacy.load("./my_textcat_model")
+doc = nlp("This is a text about a topic")
+visualize_textcat(doc)
+```
+
+| Argument       | Type          | Description                          |
+| -------------- | ------------- | ------------------------------------ |
+| `doc`          | `Doc`         | The spaCy `Doc` object to visualize. |
+| _keyword-only_ |               |                                      |
+| `title`        | Optional[str] | Title of the visualizer block.       |
+
+#### `visualize_similarity`
+
+Visualize semantic similarity using the model's word vectors. Will show a
+warning if no vectors are present in the model.
+
+```python
+import spacy
+from spacy_streamlit import visualize_similarity
+
+nlp = spacy.load("en_core_web_lg")
+visualize_similarity(nlp, ("pizza", "fries"))
+```
+
+| Argument        | Type            | Description                                                                                                                                          |
+| --------------- | --------------- | ---------------------------------------------------------------------------------------------------------------------------------------------------- |
+| `nlp`           | `Language`      | The loaded `nlp` object with vectors.                                                                                                                |
+| `default_texts` | Tuple[str, str] | The default texts to compare on load. Defaults to `("apple", "orange")`.                                                                             |
+| _keyword-only_  |                 |                                                                                                                                                      |
+| `threshold`     | float           | Threshold for what's considered "similar". If the similarity score is greater than the threshold, the result is shown as similar. Defaults to `0.5`. |
+| `title`         | Optional[str]   | Title of the visualizer block.                                                                                                                       |
+
+#### <kbd>function</kbd> `visualize_tokens`
+
+Visualize the tokens in a `Doc` and their attributes.
+
+```python
+import spacy
+from spacy_streamlit import visualize_tokens
+
+nlp = spacy.load("en_core_web_sm")
+doc = nlp("This is a text")
+visualize_tokens(doc, atrrs=["text", "pos_", "dep_", "ent_type_"])
+```
+
+| Argument       | Type          | Description                                                                                              |
+| -------------- | ------------- | -------------------------------------------------------------------------------------------------------- |
+| `doc`          | `Doc`         | The spaCy `Doc` object to visualize.                                                                     |
+| _keyword-only_ |               |                                                                                                          |
+| `attrs`        | List[str]     | The names of token attributes to use. See [`visualizer.py`](spacy_streamlit/visualizer.py) for defaults. |
+| `title`        | Optional[str] | Title of the visualizer block.                                                                           |
+
+### Cached helpers
+
+These helpers attempt to cache loaded models and created `Doc` objects.
+
+#### <kbd>function</kbd> `process_text`
+
+Process a text with a model of a given name and create a `Doc` object. Calls
+into the `load_model` helper to load the model.
+
+```python
+import streamlit as st
+from spacy_streamlit import process_text
+
+spacy_model = st.sidebar.selectbox("Model name", ["en_core_web_sm", "en_core_web_md"])
+text = st.text_area("Text to analyze", "This is a text")
+doc = process_text(spacy_model, text)
+```
+
+| Argument     | Type  | Description                                             |
+| ------------ | ----- | ------------------------------------------------------- |
+| `model_name` | str   | Loadable spaCy model name. Can be path or package name. |
+| `text`       | str   | The text to process.                                    |
+| **RETURNS**  | `Doc` | The processed document.                                 |
+
+#### <kbd>function</kbd> `load_model`
+
+Load a spaCy model from a path or installed package and return a loaded `nlp`
+object.
+
+```python
+import streamlit as st
+from spacy_streamlit import load_model
+
+spacy_model = st.sidebar.selectbox("Model name", ["en_core_web_sm", "en_core_web_md"])
+nlp = load_model(spacy_model)
+```
+
+| Argument    | Type       | Description                                              |
+| ----------- | ---------- | -------------------------------------------------------- |
+| `name`      | str        |  Loadable spaCy model name. Can be path or package name. |
+| **RETURNS** | `Language` | The loaded `nlp` object.                                 |