* feat: Add output type validation to map and reduce operations
Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>
* Refine API error message for validation failures
Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>
* Test map type validation for integer answers
Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>
* fix test for validating schemas
---------
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
* Switch from Poetry to uv for dependency management and packaging
Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>
* Update README with uv installation and dependency management instructions
Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>
* Optimize Docker build: improve dependency installation and caching
Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>
* verify that uv works on my local installation
---------
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
* Remove old typing imports
* Add future annotation
* Add back in import
* Use Iterator instead of Iterable
* fix: small edits to fix broken tests
---------
Co-authored-by: Shreya Shankar <ss.shankar505@gmail.com>
* Replace api_wrapper with runner in test fixtures and configurations
Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>
* Refactor test fixtures and reorganize configuration in test files
Co-authored-by: ss.shankar505 <ss.shankar505@gmail.com>
---------
Co-authored-by: Cursor Agent <cursoragent@cursor.com>
* fix: improve caching and don't raise error for bad gather configs
* fix: improve caching and don't raise error for bad gather configs
* feat: adding conditional gleaning
* feat: add flag to stream map operation outputs to disk
* flush partial results default False
* comment out print statement in test basic map
* rewrite test to not use batched config
---------
Co-authored-by: Shreya Shankar <ss.shankar505@gmail.com>
* partial commit
* refactor: dslrunner is now a pull based execution model
* refactor: dslrunner is now a pull based execution model
* refactor: optimizer is now using the new pull based execution model
* refactor: optimizer is now using the new pull based execution model
* refactor: optimizer is now using the new pull based execution model
* remove builder file
* remove builder file and make tests pass
* fix tests
* added enum support
* tests: add test for enum type output
* docs: update docs to support enum type schemas
---------
Co-authored-by: Shreya Shankar <ss.shankar505@gmail.com>
* Parsers can now return any number of fields, and can access the whole item
* nit: change gpt-4o to gpt-4o-mini in tests
* feat: add verbose parameter for gleaning
* feat: add verbose parameter for gleaning
* fix: tokenizers should be wrapped in try catch
* fix: resort to eval if ast eval does not work
* docs: update docs to reflect new custom parsing API
Co-authored-by: redhog <redhog@users.noreply.github.com>
* Clustering (#84)
* nit: change gpt-4o to gpt-4o-mini in tests
* feat: add verbose parameter for gleaning
* feat: add verbose parameter for gleaning
* fix: tokenizers should be wrapped in try catch
* fix: resort to eval if ast eval does not work
* Merge staging to main (after parsers refactor) (#82)
* Parsers can now return any number of fields, and can access the whole item
* nit: change gpt-4o to gpt-4o-mini in tests
* feat: add verbose parameter for gleaning
* feat: add verbose parameter for gleaning
* fix: tokenizers should be wrapped in try catch
* fix: resort to eval if ast eval does not work
* docs: update docs to reflect new custom parsing API
---------
Co-authored-by: Egil <egil.moller@freecode.no>
* Added new clustering operation
* Reverse path
* Added docs for cluster operator
* Bugfix for docs formatting
* docs: add sample parameter (#87)
* Added new clustering operation
* Reverse path
* Added docs for cluster operator
* Bugfix for docs formatting
* add tests and link to doc
---------
Co-authored-by: Shreya Shankar <ss.shankar505@gmail.com>
Co-authored-by: Egil <egil.moller@freecode.no>
* fix: fixing params in test
---------
Co-authored-by: Egil <egil.moller@freecode.no>
Co-authored-by: redhog <redhog@users.noreply.github.com>
Co-authored-by: Egil Möller <redhog@redhog.org>
* Parsers can now return any number of fields, and can access the whole item
* nit: change gpt-4o to gpt-4o-mini in tests
* feat: add verbose parameter for gleaning
* feat: add verbose parameter for gleaning
* fix: tokenizers should be wrapped in try catch
* fix: resort to eval if ast eval does not work
* docs: update docs to reflect new custom parsing API
---------
Co-authored-by: Egil <egil.moller@freecode.no>