Skip to content

feat: tensor_type for all DocVec serializations #1679

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 13 commits into from
Jul 26, 2023
Merged

Conversation

JohannesMessner
Copy link
Member

@JohannesMessner JohannesMessner commented Jun 28, 2023

closes #1677

The approach is to create a separate IO Mixin class for DocVec that takes a tensor_type argument for all deserialization operations.

  • proto
  • json
  • pandas
  • bytes
    • pickle-array
  • binary
    • streaming streaming return individual docs, so no tensor_type needed
  • base64
  • documentation

@codecov
Copy link

codecov bot commented Jul 24, 2023

Codecov Report

Patch coverage: 37.38% and project coverage change: -57.91% ⚠️

Comparison is base (9c95f3a) 88.56% compared to head (5a7318c) 30.65%.

Additional details and impacted files
@@             Coverage Diff             @@
##             main    #1679       +/-   ##
===========================================
- Coverage   88.56%   30.65%   -57.91%     
===========================================
  Files         155      133       -22     
  Lines        8061     8563      +502     
===========================================
- Hits         7139     2625     -4514     
- Misses        922     5938     +5016     
Flag Coverage Δ
docarray 30.65% <37.38%> (-57.91%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files Changed Coverage Δ
docarray/base_doc/docarray_response.py 0.00% <0.00%> (ø)
docarray/base_doc/mixins/update.py 14.60% <ø> (ø)
docarray/computation/__init__.py 100.00% <ø> (ø)
docarray/computation/abstract_comp_backend.py 76.92% <ø> (ø)
...carray/computation/abstract_numpy_based_backend.py 72.97% <ø> (ø)
docarray/computation/numpy_backend.py 41.48% <ø> (ø)
docarray/computation/tensorflow_backend.py 37.19% <ø> (ø)
docarray/computation/torch_backend.py 45.21% <ø> (ø)
docarray/data/__init__.py 0.00% <ø> (ø)
docarray/data/torch_dataset.py 0.00% <ø> (ø)
... and 87 more

... and 37 files with indirect coverage changes

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

Signed-off-by: Johannes Messner <[email protected]>
Signed-off-by: Johannes Messner <[email protected]>
Signed-off-by: Johannes Messner <[email protected]>
Signed-off-by: Johannes Messner <[email protected]>
Signed-off-by: Johannes Messner <[email protected]>
@JohannesMessner JohannesMessner marked this pull request as ready for review July 25, 2023 14:50
Signed-off-by: Johannes Messner <[email protected]>
@github-actions github-actions bot added size/xl and removed size/l labels Jul 25, 2023
@github-actions
Copy link

This PR exceeds the recommended size of 1000 lines. Please make sure you are NOT addressing multiple issues with one PR. Note this PR might be rejected due to its size.

@github-actions
Copy link

📝 Docs are deployed on https://ft-tensor-type-deser--jina-docs.netlify.app 🎉

@JoanFM JoanFM merged commit efeab90 into main Jul 26, 2023
@JoanFM JoanFM deleted the tensor-type-deser branch July 26, 2023 02:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

tensor_type argument for all DocVec deserializations
2 participants