24.01 Release notes

Instabase 24.01 is a major release that introduces new features, enhancements, and bug fixes.

Subsequent patch releases typically contain bug fixes along with testing, optimizations, security fixes, and other internal changes that do not change user functionality.

Release 24.01.69

February 5, 2025

This release contains no changes to user functionality.

Release 24.01.68

February 1, 2025

  • Resolved [CVE-2024-47535] | Upgraded the netty-common package to version 4.1.115.

Release 24.01.67

January 29, 2025

Release 24.01.66

January 24, 2025

Release 24.01.65

January 23, 2025

This release contains no changes to user functionality.

Release 24.01.64

January 22, 2025

Release 24.01.63

  • Resolved [CVE-2024-52804] | Updated the Python tornado package to version 6.4.2.

Release 24.01.62

January 9, 2025

This release contains no changes to user functionality.

Release 24.01.62

December 9, 2024

The image tag for this release was updated.

Release 24.01.61

December 9, 2024

  • Resolved GHSA-m425-mq94-257g | Updated the grpc module to version 1.57.1.

  • Resolved [CVE-2024-52804] | Updated tornado to version 6.4.2.

  • Resolved [CVE-2024-24786] | Updated the protobuf packate to version 1.33.0.

  • Resolved [CVE-2024-4671] | Updated the playwright package.

  • Resolved [CVE-2024-45337] | Updated the crypto package to version 0.31.0.

  • Resolved [CVE-2024-56326] | Upgraded the jinja package to version 3.1.5.

[CS-5061] Fix file-sevice MAX_CONCURRENT_STREAMS behavior in GRPC 1.57.1+ by @PV99 in #66791

  • The OCR service crashed if it encountered many concurrent requests.

  • Resolved CVE-2023-28858, CVE-2023-28859 | Updated the redis package to version 4.5.4.

Release 24.01.60

December 11, 2024

Release 24.01.59

December 11, 2024

Release 24.01.58

December 4, 2024

Release 24.01.57

November 28, 2024

  • Batch requests to the model service experienced transient errors.

  • Resolved [CVE-2024-5569] | Updated the zipp package to 3.19.1.

  • Resolved [CVE-2024-24790] | Update go.mod version to 1.22.9.

  • Resolved [CVE-2024-37891] | Updated the urllib3 package to version 1.26.20.

  • Resolved [CVE-2024-41131] | Updated the form-recognizer package to 2022-08-31.20241107.1-14d4cf9e.

  • Resolved [CVE-2022-30187] Updated azure-storage-blob package to version 12.13.0.

Release 24.01.56

November 20, 2024

Release 24.01.55

November 14, 2024

  • Tasks could be marked as failed when api-server pods were temporarily overloaded. If this occurs now, the task is retried.

  • An error sometimes occurred with core platform replicas.

  • Resolved [CVE-2024-7254]

  • Resolved [CVE-2024-5452] | Upgraded pytorch-lightning package to version 2.4.

  • Resolved [CVE-2024-28122] | Upgraded package jwx to version 1.2.29.

  • Security | Upgraded /nats-server/v2 package to version 1.17.11.

Release 24.01.54

November 6, 2024

  • Job count, which was previously removed from the Flow Review due to performance issues, has been reinstated, but you must enable it in Admin > Configuration

Release 24.01.53

October 31, 2024

  • Resolved [CVE-2022-1996] Updated go-restful to 2.16.0.

  • Security | Upgraded prometheus to v2.55.0.

Release 24.01.52

October 30, 2024

Release 24.01.51

October 23, 2024

  • You can now use uppercase letters in your username, service account names, and space names.

  • Resolved [CVE-2024-31583] | Upgraded the torch package.

  • Resolved [CVE-2024-6345] | Upgraded the setuptools package.

  • Resolved [CVE-2024-4603] | Upgraded Redis and cryptography.

Release 24.01.50

October 16, 2024

  • Production performance was improved in cases where the Instabase clusters were affected by poor performance by the Redis deployment.

  • Rarely, reading a 0-length file with the Text Editor failed.

Release 24.01.49

October 9, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.48

October 9, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.47

October 3, 2024

  • An issue in an third-party library caused breakages in an internal LLM.

Release 24.01.46

September 27, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.45

September 18, 2024

  • Execution of flows with Reduce UDF steps in parallel branches encountered flow execution errors.

Release 24.01.44

September 12, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.43

September 5, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.42

August 29, 2024

  • For flows with checkpoints following an STP checkpoint, input records were incorrectly tracked, resulting in missing documents.

  • The site setting Restricted file upload extensions now also prevents downloading files with the specified extension name or renaming a file to that name.

Release 24.01.41

August 19, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.40

August 5, 2024

  • When running a flow, the out folder in the selected output directory was not correctly cleaned up when the delete_out_dir was enabled.

Release 24.01.39

July 29, 2024

  • You can now make a Solution Builder project’s Resources directory accessible with custom code.

Release 24.01.38

July 29, 2024

  • Improved OCR text extraction and preprocessing of documents in ML Studio and the Reader app.

Release 24.01.37

July 19, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.36

July 17, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.35

July 10, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.34

July 8, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.33

June 28, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.32

June 27, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.31

June 27, 2024

  • If a reviewer opens a job assigned to a different reviewer, the flow review results now correctly show a warning banner for that job.

  • Encoded information from email headers, such as from, to, subject, and date, is now correctly extracted.

Release 24.01.30

June 20, 2024

  • In some cases, model service pods got stuck retrying on an entity detection model, blocking the model service from taking new requests.

  • The size of .ibdoc could grow too large if you performed file splitting in ML Studio with barcode detection enabled.

Release 24.01.29

June 12, 2024

  • In flows with multiple split classifiers in the same path using the “Split PDF” option, generated PDFs were sometimes incorrectly split.

Release 24.01.28

June 7, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.27

June 6, 2024

  • PDF attachments weren’t assigned appropriate names during process files steps.

  • Job count, which was previously removed from the Flow Dashboard due to performance issues, has been reinstated and appears by default. You can disable job count at Admin > Configuration if you notice performance impacts in large-scale deployments.

Release 24.01.26

June 3, 2024

  • The job service sometimes erroneously requeued completed tasks.

Release 24.01.25

May 29, 2024

This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.

Release 24.01.24

May 21, 2024

  • Processing of large XLSX or XLS files can now be improved with preprocessing them, which removes all empty rows and columns. Preprocessing simplifies XLSX files and improves performance; however, it also removes all special text formatting, such as color or bold text. If text styling or images are critical to your results, do not preprocess your files.

    To enable preprocessing, select Preprocess Excel files in your OCR configuration settings.

Release 24.01.23

May 15, 2024

  • Flows did not generate results if one of the completed tasks didn’t have the flow_results field set.

Release 24.01.22

May 8, 2024

  • When you made a change in the human review screen, the original value incorrectly showed as empty.

Release 24.01.21

May 1, 2024

  • When resuming a flow during human review, it erroneously retried step errors.

Release 24.01.20

  • Unused tags have been removed from the Flow Dashboard.

Release 24.01.19

  • You can now include multiple apply classifier steps in flows. For example, you can use a first split classifier step to classify documents into general types, like identity documents and tax forms. Then, you can use a second split classifier step to subdivide these general types into more specific types, like passports and driver’s licenses.

Release 24.01.18

This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.

Release 24.01.17

This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.

Release 24.01.16

This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.

  • The schema for the other class wasn’t applied to unclassified documents as intended.

Release 24.01.15

This version was not released.

Release 24.01.14

This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.

Release 24.01.13

This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.

Release 24.01.12

  • Source data is more accurately identified for extraction prompts due to improved OCR token matching.

  • When resuming a job from Flow Review, the status did not update if you stayed on the Flow Review screen.

Release 24.01.11

  • From the Flow Review dashboard, you can now bulk assign jobs to a reviewer.

Release 24.01.10

  • You can now visualize the progress of your flows on a file-by-file basis. Click into a file in your flow, and you can see any long-running steps that delay the completion of their flow.

Release 24.01.9

This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.

Release 24.01.8

  • Annotations weren’t carried over when importing an annotation set from the file system into Solution Builder, or when exporting an ML Studio dataset in the same environment.

Release 24.01.7

This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.

Release 24.01.6

  • In ML Studio, the field list failed to load in certain circumstances.

Release 24.01.5

  • ImageMagick height and width limit is increased from 16K to 20K. Additionally, new environment variables in celery-app-tasks control ImageMagick height, width, and area: MAGICK_HEIGHT_LIMIT, MAGICK_WIDTH_LIMIT, MAGICK_AREA_LIMIT.

Release 24.01.4

This is the first generally available release of Instabase 24.01.

New features

Flow Review

  • Public Preview | Document-based review introduces a new method for managing human reviews where reviews are broken down into individual documents. This method provides more granularity in review assignment, and more flexibility for reviewers, because you can assign, review, or escalate individual documents, or open multiple documents from different flow jobs for review. Flows must be resumed as a separate step when all associated documents are marked reviewed.

    Document-based reviews are supported only for flows that are managed as a deployed solution. Document-based reviews can’t be exported, and they’re not included in metrics reported in the Solution Dashboard.

Enhancements

Deployment Manager

  • You can now use the Update base config image API to override the defined registry. The change persists until a new base config is applied for the specified service, at which point the image uses the registry defined in the Installer.

  • From the Infra Dashboard, you can now view the live Kubernetes configuration for a given Kubernetes element. See the element’s Config tab for the materialized configuration and the Live K8s Config tab for the live configuration.

  • Network policies have been added for the jobs underlying the Test Runner test post-install action (solution-diagnostic-artifacts) and the base model image upload post-install action (mktplaceupdate). Previously, if your security settings required network policies for all jobs, these actions wouldn’t run. mTLS traffic is not currently supported when running these jobs.

Observability

  • The legacy Prometheus service has been succeeded by the VictoriaMetrics vmagent service. The vmagent service is significantly better optimized, boasting up to a 7x improvement in memory optimization and approximately 2x improvement in CPU optimization, compared to Prometheus.

    • The vmagent service supports horizontal scaling, allowing you to expand its capacity by adding additional instances.

    • The vmagent service is deployed as a StatefulSet service.

    • Rollback procedures are included as a contingency measure in the event of any issues.

  • VictoriaMetrics vmalert service now handles the evaluation of alerting rules. This service operates as a sidecar alongside VictoriaMetrics, enhancing the alerting capabilities of the system.

Flow

  • Improvements to the Flow job service provide faster startup for running flow jobs, reduced memory consumption, and increased queue capacity. Specifically, internal testing demonstrated a 5x improvement in startup time and the ability to queue up to 18x more jobs.

  • The Python library pyxlsb is now available for use in UDFs. This library enables importing .xlsb files in a pre-flow UDF.

Flow Review

  • You can now optionally display provenance only for the selected field by enabling Only Show Selected Field Annotations in field list options.

Flow Dashboard

  • Public Preview | Flow logs now include a visualization of processing time for each file and step in the job, so you can quickly identify slowdowns. You can drill in to each file to further visualize time taken by each step for a specific file. To access flow execution visualizations, from Flow Dashboard, click Logs for any job, then select the Visualize tab.

  • You can now view and filter by priority in Flow Dashboard. A new Priority column can be enabled in Flow Dashboard settings, and in the filter sidebar, you can filter by a specific priority. Flow priority can be set when using the API to run a flow.

  • The tag search filter in Flow Dashboard, as well as the jobs/list API endpoint, is now case sensitive.

Bug fixes

Flow

  • License usage is more accurately counted for flows that include apply checkpoint steps. Previously, usage was counted only when a flow completed.

Flow Review

  • When you used the pop-out icon to open a document in a separate window, the first file in the file list was displayed, rather than the selected document.

ML Studio

  • If portions of a date were missing, the format_date function inserted random date values. For example, if the date was 11/2020, the function would generate a random date value and return 11/01/2020.

Deprecations and removals

See the deprecations and removals page for the latest announcements, as well as previous deprecations and removals.

Deployment guide

  • Workload autoscaling was introduced as a public preview feature in release 23.07 and remains in public preview. As a public preview feature, workload autoscaling is disabled by default. Workload autoscaling has several infrastructure requirements. For instructions on enabling workload autoscaling during your upgrade, see the workload autoscaling feature documentation.