docetl2/docs
Shreya Shankar bcac6872f5
Embedding blocking threshold optimization (#473)
* feat: Add runtime blocking threshold optimization

Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>

* Checkpoint before follow-up message

Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>

* Refactor: Simplify target_recall retrieval in Equijoin and Resolve

Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>

* Refactor: Improve blocking documentation and add auto-blocking

Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>

* allow resolve and equijoin to figure out blocking thresholds on the fly.

---------

Co-authored-by: Cursor Agent <cursoragent@cursor.com>
2025-12-29 19:47:21 -06:00
..
advanced
api-reference fix: add code ops and extract to python api (#463) 2025-11-24 14:18:59 -06:00
assets chore: switching to cloudbank for blob storage (#400) 2025-07-31 14:28:00 -07:00
community
concepts Add MOAR optimizer to docetl (#464) 2025-11-28 13:47:42 -06:00
examples Implement LiteLLM fallback models for reliability (#453) 2025-11-14 13:17:47 -08:00
execution
operators Embedding blocking threshold optimization (#473) 2025-12-29 19:47:21 -06:00
optimization Fast Decomposition for Map Operations in DocWrangler (#472) 2025-12-29 18:22:02 -06:00
pandas Update Pandas API to Use New Output Parameter Format (#409) 2025-08-13 10:41:49 -07:00
playground
python
stylesheets
best-practices.md
index.md feat: add claude-code skill (#469) 2025-12-27 22:23:30 -06:00
installation.md feat: add claude-code skill (#469) 2025-12-27 22:23:30 -06:00
quickstart-claude-code.md feat: add claude-code skill (#469) 2025-12-27 22:23:30 -06:00
retrievers.md feat: add claude-code skill (#469) 2025-12-27 22:23:30 -06:00
tutorial-pythonapi.md fix: add code ops and extract to python api (#462) 2025-11-24 14:10:07 -06:00
tutorial.md fix: add code ops and extract to python api (#463) 2025-11-24 14:18:59 -06:00