refactor: upgrade examples to release 2.0
jakobkruse1 opened this issue · 12 comments
We want to know how examples will run with Jina 2.0.
The following tasks should be finished for each of them:
- Use Jina 2.0 interface
- C&P Executors from Hub and adopt to new Executor base class
- Adopt Readme according to the Example Template #518
- Add comments to YAML files to make them better understandable #445
Examples to be updated:
- wikipedia-sentences #619
- pokedex #626
- multi modal PDF #629
- multires-lyrics #648 Sebastian, Jakob
- cross-model #631
- query-while-indexing (only once dump-load approach has been agreed upon) Kelton: #665
Other tasks:
- Move all other examples to legacy and update example README.
- Create tutorials
We might continue with more afterwards.
As discussed in meeting today, I'd recommend merging these uni-modal (mono-modal?) examples into pokedex, and possibly renaming the example to "image/audio/video search" or similar:
- audio search
- tumblr gif search
Also, if we can get a decent metadata-heavy text dataset with good chunkability and searchability, we could look at merging the text-based examples (wikipedia-*, multires-lyrics) in future
Might also be good to rename 'wikipedia-sentences' to plain old 'text-search'. Makes it more google-able, and if/when we change the dataset we can retain the old URL
Instead of working on cross-modal
, I suggest starting with cross-modal-pdf
, which is using smaller model and less data. Plus, it is equipped with proper tests.
I share the same view with @alexcg1, we need to rename it just example-audio-search
, example-video-search
, example-text-search
. As we are moving to 2.0
, I suggest pushing the proposal we had before that we should split the examples into separated repos.
Do we have a list somewhere of
- what we're updating?
- what we're merging?
- what we're deleting?
If not, I'll create a table in this thread and we can discuss
I share the same view with @alexcg1, we need to rename it just
example-audio-search
,example-video-search
,example-text-search
. As we are moving to2.0
, I suggest pushing the proposal we had before that we should split the examples into separated repos.
I suggest we wait until all other example work is finished before doing this. If we change names mid-stream people are going to have a nightmare properly merging
@nan-wang @CatStark @florian-hoenicke Some of these examples have been upgraded on an older version of jina2.0. We should use hub executors in all examples and update all of them to the latest jina version. Do you agree?
I really like the idea of using Hub Executors - many folks learn Jina by adapting the examples, so if Hub is baked in then they'll keep using Hub stuff (except for when they need to build their own Executors, which they can then push to Hub 😄 )
We should use hub executors in all examples and update all of them to the latest jina version. Do you agree?
Sure, let's do it.