Skip to content

Commit ea9ee81

Browse files
committed
typo
1 parent e3f196e commit ea9ee81

File tree

2 files changed

+13
-7
lines changed

2 files changed

+13
-7
lines changed

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -603,7 +603,7 @@ This notebooks in this section shows how you can load data from various sources
603603
</a>
604604
</td>
605605
<td rowspan="4">
606-
<b> 🧠 TIMM Embeddings:</b> Compute your embeddings using <a href="https://github.com/huggingface/pytorch-image-models">TIMM (PyTorch Image Models)</a> and run fastdup over the them to analyze for issues. model. Runs on CPU and GPU.
606+
<b> 🧠 TIMM Embeddings:</b> Compute dataset embeddings using <a href="https://github.com/huggingface/pytorch-image-models">TIMM (PyTorch Image Models)</a> and run fastdup over the them to surface dataset issues. Runs on CPU and GPU.
607607
</td>
608608
<td align="center" width="80">
609609
<a href="https://nbviewer.org/github/visual-layer/fastdup/blob/main/examples/embeddings-timm.ipynb">

examples/embeddings-timm.ipynb

Lines changed: 12 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -62,7 +62,7 @@
6262
"\n",
6363
"First, let's install the neccessary packages:\n",
6464
"\n",
65-
"- fastdup - To analyze issues in the dataset.\n",
65+
"- [fastdup](https://github.com/visual-layer/fastdup) - To analyze issues in the dataset.\n",
6666
"- [TIMM (PyTorch Image Models)](https://github.com/huggingface/pytorch-image-models) - To acquire pre-trained models."
6767
]
6868
},
@@ -130,7 +130,11 @@
130130
"metadata": {},
131131
"source": [
132132
"## List TIMM Models\n",
133-
"There are over a thousand models on TIMM. Let's list down models that match the keyword `dino`."
133+
"There are currently 1212 computer vision models on TIMM. Pick a model of your choice to compute the embedding with.\n",
134+
"\n",
135+
"Now, pick a model of your choice. For demonstration, we will go with a relatively new model `vit_small_patch14_dinov2.lvd142m` from MetaAI. \n",
136+
"\n",
137+
"Let's list down models that match the keyword `dino`."
134138
]
135139
},
136140
{
@@ -171,8 +175,6 @@
171175
"id": "633dce0c-47eb-4039-8cd4-a36874c49b8a",
172176
"metadata": {},
173177
"source": [
174-
"Now, pick a model of your choice. For demonstration, we will go with a relatively new model `vit_small_patch14_dinov2.lvd142m` from MetaAI. \n",
175-
"\n",
176178
"DINOv2 models produce high-performance visual features that can be directly employed with classifiers as simple as linear layers on a variety of computer vision tasks; these visual features are robust and perform well across domains without any requirement for fine-tuning. Read more about DINOv2 [here](https://github.com/facebookresearch/dinov2).\n",
177179
"\n",
178180
"It makes sense for us to use DINOv2 as a model to create an embedding of the dataset."
@@ -288,7 +290,7 @@
288290
"source": [
289291
"## Run fastdup\n",
290292
"\n",
291-
"Now what's left is to load the embeddings into fastdup and run an analysis to surface dataset issues."
293+
"Now let's load the embeddings into fastdup and run an analysis to surface dataset issues."
292294
]
293295
},
294296
{
@@ -2467,7 +2469,11 @@
24672469
"metadata": {},
24682470
"source": [
24692471
"## Wrap Up\n",
2470-
"In this tutorial, we showed how you can run fastdup using pre-computed feature vectors. Running over pre-computed feature vectors significantly reduces run time compared to running over raw image files.\n",
2472+
"In this tutorial, we showed how you can compute embeddings on your dataset using TIMM and run fastdup on top of it to surface dataset issues.\n",
2473+
"\n",
2474+
"Questions about this tutorial? Reach out to us on our [Slack channel](https://visuallayer.slack.com/)!\n",
2475+
"\n",
2476+
"\n",
24712477
"\n",
24722478
"Next, feel free to check out other tutorials -\n",
24732479
"\n",

0 commit comments

Comments
 (0)