24.01 Release notes
Instabase 24.01 is a major release that introduces new features, enhancements, and bug fixes.
Subsequent patch releases typically contain bug fixes along with testing, optimizations, security fixes, and other internal changes that do not change user functionality.
Release 24.01.69
February 5, 2025
This release contains no changes to user functionality.
Release 24.01.68
February 1, 2025
- Resolved [CVE-2024-47535] | Upgraded the
netty-common
package to version 4.1.115.
Release 24.01.67
January 29, 2025
-
Resolved [CVE-2024-47554] | Updated the
commons-io:commons-io
package to version 2.14.0. -
Resolved [CVE-2024-8096] | Updated
jaeger-agent
to 1.62.0. -
Resolved [CVE-2024-8309] | Updated the
langchain
package to version 0.2.5.
Release 24.01.66
January 24, 2025
- Resolved [CVE-2024-45337] | Updated the
crypto
package to version 0.31.0.
Release 24.01.65
January 23, 2025
This release contains no changes to user functionality.
Release 24.01.64
January 22, 2025
-
Updated Grafana to version 11.4.0.
-
Resolved [CVE-2024-56326] | Update
jinja
package to version 3.1.5. -
Resolved [CVE-2024-38820], [CVE-2024-38827] | Updated the
springframework#spring-context
library to version 6.1.14.
Release 24.01.63
- Resolved [CVE-2024-52804] | Updated the Python
tornado
package to version 6.4.2.
Release 24.01.62
January 9, 2025
This release contains no changes to user functionality.
Release 24.01.62
December 9, 2024
The image tag for this release was updated.
Release 24.01.61
December 9, 2024
-
Resolved GHSA-m425-mq94-257g | Updated the
grpc
module to version 1.57.1. -
Resolved [CVE-2024-52804] | Updated
tornado
to version 6.4.2. -
Resolved [CVE-2024-24786] | Updated the
protobuf
packate to version 1.33.0. -
Resolved [CVE-2024-4671] | Updated the
playwright
package. -
Resolved [CVE-2024-45337] | Updated the
crypto
package to version 0.31.0. -
Resolved [CVE-2024-56326] | Upgraded the
jinja
package to version 3.1.5.
[CS-5061] Fix file-sevice MAX_CONCURRENT_STREAMS behavior in GRPC 1.57.1+ by @PV99 in #66791
-
The OCR service crashed if it encountered many concurrent requests.
-
Resolved CVE-2023-28858, CVE-2023-28859 | Updated the
redis
package to version 4.5.4.
Release 24.01.60
December 11, 2024
- Resolved [CVE-2023-28858] | Updated the
redis
package to version 4.5.3.
Release 24.01.59
December 11, 2024
-
Resolved [GHSA-78wr-2p64-hpwj] | Updated the
ray
package to version 2.39.0. -
Resolved [CVE-2024-3095] | Updated the
langchain
package to version 0.2.10. -
Resolved [CVE-2024-49767] | Updated
werkzeug
to version 3.0.6. -
Resolved [CVE-2024-5187] | Updated
onnx
package to version 1.17.0. -
Resolved [GHSA-h4gh-qq45-vh27] | Updated
cryptography
package to version 43.0.1.
Release 24.01.58
December 4, 2024
- Resolved [CVE-2024-49767] | Updated
werkzeug
package to version 3.0.6.
Release 24.01.57
November 28, 2024
-
Batch requests to the model service experienced transient errors.
-
Resolved [CVE-2024-5569] | Updated the
zipp
package to 3.19.1. -
Resolved [CVE-2024-24790] | Update
go.mod
version to 1.22.9. -
Resolved [CVE-2024-37891] | Updated the
urllib3
package to version 1.26.20. -
Resolved [CVE-2024-41131] | Updated the
form-recognizer
package to 2022-08-31.20241107.1-14d4cf9e. -
Resolved [CVE-2022-30187] Updated
azure-storage-blob
package to version 12.13.0.
-
Resolved [CVE-2024-52304] | Updated
aiohttp
package to version 3.10.11. -
Resolved [[CVE-2024-49768]] (https://nvd.nist.gov/vuln/detail/CVE-2024-49768) | Updated the
waitress
package to version 3.0.1.
Release 24.01.56
November 20, 2024
-
Resolved [CVE-2024-7254] | Updated opensearch version to 2.18. <!–VTEST–47373–>
-
Resolved [CVE-2024-7254], [CVE-2024-47554] | Fixed vulnerability issues for
protoc
andcommons.io
in the conversion service. -
Resolved [CVE-2024-38808] | Update
spring-expression
to 5.3.39.
Release 24.01.55
November 14, 2024
-
Tasks could be marked as failed when
api-server
pods were temporarily overloaded. If this occurs now, the task is retried. -
An error sometimes occurred with core platform replicas.
-
Resolved [CVE-2024-7254]
-
Resolved [CVE-2024-5452] | Upgraded
pytorch-lightning
package to version 2.4. -
Resolved [CVE-2024-28122] | Upgraded package
jwx
to version 1.2.29. -
Security | Upgraded
/nats-server/v2
package to version 1.17.11.
Release 24.01.54
November 6, 2024
- Job count, which was previously removed from the Flow Review due to performance issues, has been reinstated, but you must enable it in Admin > Configuration
Release 24.01.53
October 31, 2024
-
Resolved [CVE-2022-1996] Updated
go-restful
to 2.16.0. -
Security | Upgraded
prometheus
to v2.55.0.
Release 24.01.52
October 30, 2024
-
Resolved [CVE-2020-26892] | Upgraded package
github.com/nats-io/jwt
to version 1.1.0 and packagegithub.com/nats-io/nats-server/v2
to version 2.1.9. -
Resolved [CVE-2024-41110]
-
Resolved [CVE-2020-26160] | Upgraded the
jwt-go
package to version 4.0.0-preview1. -
Resolved [CVE-2023-39631] | Updated the
numexpr
package to 2.8.5. -
Resolved [CVE-2019-25211] | Updated the
cors
package to v1.6.0. -
Resolved [CVE-2023-5752] | Updated the
pip
package to 24.3.1. -
Resolved [CVE-2024-6345] | Updated
pip
setuptools
. -
Resolved [CVE-2022-1996] Updated
go-restful
to 2.16.0. -
Security | Upgraded
prometheus
to v2.55.0.
Release 24.01.51
October 23, 2024
-
You can now use uppercase letters in your username, service account names, and space names.
-
Resolved [CVE-2024-31583] | Upgraded the
torch
package. -
Resolved [CVE-2024-6345] | Upgraded the
setuptools
package. -
Resolved [CVE-2024-4603] | Upgraded Redis and cryptography.
Release 24.01.50
October 16, 2024
-
Production performance was improved in cases where the Instabase clusters were affected by poor performance by the Redis deployment.
-
Rarely, reading a 0-length file with the Text Editor failed.
Release 24.01.49
October 9, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.48
October 9, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.47
October 3, 2024
- An issue in an third-party library caused breakages in an internal LLM.
Release 24.01.46
September 27, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.45
September 18, 2024
- Execution of flows with Reduce UDF steps in parallel branches encountered flow execution errors.
Release 24.01.44
September 12, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.43
September 5, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.42
August 29, 2024
-
For flows with checkpoints following an STP checkpoint, input records were incorrectly tracked, resulting in missing documents.
-
The site setting Restricted file upload extensions now also prevents downloading files with the specified extension name or renaming a file to that name.
Release 24.01.41
August 19, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.40
August 5, 2024
- When running a flow, the
out
folder in the selected output directory was not correctly cleaned up when thedelete_out_dir
was enabled.
Release 24.01.39
July 29, 2024
- You can now make a Solution Builder project’s Resources directory accessible with custom code.
Release 24.01.38
July 29, 2024
- Improved OCR text extraction and preprocessing of documents in ML Studio and the Reader app.
Release 24.01.37
July 19, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.36
July 17, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.35
July 10, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.34
July 8, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.33
June 28, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.32
June 27, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.31
June 27, 2024
-
If a reviewer opens a job assigned to a different reviewer, the flow review results now correctly show a warning banner for that job.
-
Encoded information from email headers, such as from, to, subject, and date, is now correctly extracted.
Release 24.01.30
June 20, 2024
-
In some cases, model service pods got stuck retrying on an entity detection model, blocking the model service from taking new requests.
-
The size of
.ibdoc
could grow too large if you performed file splitting in ML Studio with barcode detection enabled.
Release 24.01.29
June 12, 2024
- In flows with multiple split classifiers in the same path using the “Split PDF” option, generated PDFs were sometimes incorrectly split.
Release 24.01.28
June 7, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.27
June 6, 2024
-
PDF attachments weren’t assigned appropriate names during process files steps.
-
Job count, which was previously removed from the Flow Dashboard due to performance issues, has been reinstated and appears by default. You can disable job count at Admin > Configuration if you notice performance impacts in large-scale deployments.
Release 24.01.26
June 3, 2024
- The job service sometimes erroneously requeued completed tasks.
Release 24.01.25
May 29, 2024
This patch contains testing, optimizations, security, and other minor internal changes. User functionality is unchanged.
Release 24.01.24
May 21, 2024
-
Processing of large XLSX or XLS files can now be improved with preprocessing them, which removes all empty rows and columns. Preprocessing simplifies XLSX files and improves performance; however, it also removes all special text formatting, such as color or bold text. If text styling or images are critical to your results, do not preprocess your files.
To enable preprocessing, select Preprocess Excel files in your OCR configuration settings.
Release 24.01.23
May 15, 2024
- Flows did not generate results if one of the completed tasks didn’t have the flow_results field set.
Release 24.01.22
May 8, 2024
- When you made a change in the human review screen, the original value incorrectly showed as empty.
Release 24.01.21
May 1, 2024
- When resuming a flow during human review, it erroneously retried step errors.
Release 24.01.20
- Unused tags have been removed from the Flow Dashboard.
Release 24.01.19
- You can now include multiple apply classifier steps in flows. For example, you can use a first split classifier step to classify documents into general types, like identity documents and tax forms. Then, you can use a second split classifier step to subdivide these general types into more specific types, like passports and driver’s licenses.
Release 24.01.18
This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.
Release 24.01.17
This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.
Release 24.01.16
This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.
- The schema for the other class wasn’t applied to unclassified documents as intended.
Release 24.01.15
This version was not released.
Release 24.01.14
This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.
Release 24.01.13
This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.
Release 24.01.12
-
Source data is more accurately identified for extraction prompts due to improved OCR token matching.
-
When resuming a job from Flow Review, the status did not update if you stayed on the Flow Review screen.
Release 24.01.11
- From the Flow Review dashboard, you can now bulk assign jobs to a reviewer.
Release 24.01.10
- You can now visualize the progress of your flows on a file-by-file basis. Click into a file in your flow, and you can see any long-running steps that delay the completion of their flow.
Release 24.01.9
This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.
Release 24.01.8
- Annotations weren’t carried over when importing an annotation set from the file system into Solution Builder, or when exporting an ML Studio dataset in the same environment.
Release 24.01.7
This patch contains testing, optimizations, and other minor internal changes. User functionality is unchanged.
Release 24.01.6
- In ML Studio, the field list failed to load in certain circumstances.
Release 24.01.5
- ImageMagick height and width limit is increased from 16K to 20K. Additionally, new environment variables in
celery-app-tasks
control ImageMagick height, width, and area:MAGICK_HEIGHT_LIMIT
,MAGICK_WIDTH_LIMIT
,MAGICK_AREA_LIMIT
.
Release 24.01.4
This is the first generally available release of Instabase 24.01.
New features
Flow Review
-
Public Preview | Document-based review introduces a new method for managing human reviews where reviews are broken down into individual documents. This method provides more granularity in review assignment, and more flexibility for reviewers, because you can assign, review, or escalate individual documents, or open multiple documents from different flow jobs for review. Flows must be resumed as a separate step when all associated documents are marked reviewed.
Document-based reviews are supported only for flows that are managed as a deployed solution. Document-based reviews can’t be exported, and they’re not included in metrics reported in the Solution Dashboard.
Enhancements
Deployment Manager
-
You can now use the Update base config image API to override the defined registry. The change persists until a new base config is applied for the specified service, at which point the image uses the registry defined in the Installer.
-
From the Infra Dashboard, you can now view the live Kubernetes configuration for a given Kubernetes element. See the element’s Config tab for the materialized configuration and the Live K8s Config tab for the live configuration.
-
Network policies have been added for the jobs underlying the Test Runner test post-install action (
solution-diagnostic-artifacts
) and the base model image upload post-install action (mktplaceupdate
). Previously, if your security settings required network policies for all jobs, these actions wouldn’t run. mTLS traffic is not currently supported when running these jobs.
Observability
-
The legacy Prometheus service has been succeeded by the VictoriaMetrics
vmagent
service. Thevmagent
service is significantly better optimized, boasting up to a 7x improvement in memory optimization and approximately 2x improvement in CPU optimization, compared to Prometheus.-
The
vmagent
service supports horizontal scaling, allowing you to expand its capacity by adding additional instances. -
The
vmagent
service is deployed as a StatefulSet service. -
Rollback procedures are included as a contingency measure in the event of any issues.
-
-
VictoriaMetrics
vmalert
service now handles the evaluation of alerting rules. This service operates as a sidecar alongside VictoriaMetrics, enhancing the alerting capabilities of the system.
Flow
-
Improvements to the Flow job service provide faster startup for running flow jobs, reduced memory consumption, and increased queue capacity. Specifically, internal testing demonstrated a 5x improvement in startup time and the ability to queue up to 18x more jobs.
-
The Python library
pyxlsb
is now available for use in UDFs. This library enables importing.xlsb
files in a pre-flow UDF.
Flow Review
- You can now optionally display provenance only for the selected field by enabling Only Show Selected Field Annotations in field list options.
Flow Dashboard
-
Public Preview | Flow logs now include a visualization of processing time for each file and step in the job, so you can quickly identify slowdowns. You can drill in to each file to further visualize time taken by each step for a specific file. To access flow execution visualizations, from Flow Dashboard, click Logs for any job, then select the Visualize tab.
-
You can now view and filter by priority in Flow Dashboard. A new Priority column can be enabled in Flow Dashboard settings, and in the filter sidebar, you can filter by a specific priority. Flow priority can be set when using the API to run a flow.
-
The tag search filter in Flow Dashboard, as well as the
jobs/list
API endpoint, is now case sensitive.
Bug fixes
Flow
- License usage is more accurately counted for flows that include apply checkpoint steps. Previously, usage was counted only when a flow completed.
Flow Review
- When you used the pop-out icon to open a document in a separate window, the first file in the file list was displayed, rather than the selected document.
ML Studio
- If portions of a date were missing, the
format_date
function inserted random date values. For example, if the date was 11/2020, the function would generate a random date value and return 11/01/2020.
Deprecations and removals
See the deprecations and removals page for the latest announcements, as well as previous deprecations and removals.
Deployment guide
- Workload autoscaling was introduced as a public preview feature in release 23.07 and remains in public preview. As a public preview feature, workload autoscaling is disabled by default. Workload autoscaling has several infrastructure requirements. For instructions on enabling workload autoscaling during your upgrade, see the workload autoscaling feature documentation.