Release 3.3.0
Please follow the Upgrade Guide to update existing CluedIn installations.
Annotation
Fixes
- Supporting release for platform
- Move database table creation scripts into the runtime (AB#3533)
App
Fixes
- The environment variable
EMAILSETTINGS__ENABLESSL
can now be set to false
to disable ssl connections to smtp clients
- Adding roles to user now uses the legacy endpoints to allow for multi tenant roles
- Implement multi tenant users with matching emails (AB#5540)
Clean
Features
- Supporting release for platform
- Move database table creation scripts into the runtime (AB#3534)
Controller
Features
Added support for sendMail
CRD type to send simple on-off emails based on a configmap.
apiVersion: api.cluedin.com/v1
kind: SendMail
metadata:
name: 'sendmail-example'
spec:
emailSecret: "cluedin-email"
emailTemplateResource: "email-template"
apiVersion: v1
kind: ConfigMap
metadata:
name: email-template
data:
subject: "WELCOME to CLUEDIN!"
body: "Some talk about things.."
to: "stp@cluedin.com"
from: "no-reply@cluedin.com"
Datasource
Features
- Supporting release for platform
- Move database table creation scripts into the runtime (AB#3532)
Gql
Features
- Udpated to Node.js 16
- Add config for engine room charts (#11359)
- Move database table creation scripts into the runtime (AB#3532)
Fixes
- Engine room chart name (AB#7305)
Runtime
Docker Image |
Tags |
cluedin/neo4j |
3.3 , 3.3.0 , 3.3.0_35539 |
cluedin/neo4j-init |
3.3 , 3.3.0 , 3.3.0_35539 |
cluedin/openrefine |
3.3 , 3.3.0 , 3.3.0_35539 |
cluedin/sqlserver |
3.3 , 3.3.0 , 3.3.0_35539 |
cluedin/sqlserver-init |
3.3 , 3.3.0 , 3.3.0_35539 |
Features
- Added
sqlserver-init
container which will push the scripts to an already running instance of SQL Server.
- Parameters:
- MSSQL_HOST - SQL Server Host ( Default :
localhost
)
- MSSQL_PORT - SQL Server Host Port ( Default :
1433
)
- MSSQL_USER - SQL Server User ( Default :
sa
)
- MSSQL_PASSWORD - SQL Server Password ( Default:
yourStrong(!)Password
)
- MSSQL_DACPACS - SQL Scripts directory ( Default:
./dacpacs
)
- SQLCMD - SQLCMD exexcutable location ( Default:
sqlcmd
)
- SQLPACKAGE - SQLPACKAGE exexcutable location ( Default:
sqlpackage
)
- MSSQL_CLIENTUSER - Non-admin user ( Default: `` - Skips creation if blank)
- MSSQL_CLIENTUSER_PASSWORD - Non-admin user password ( Default: `` - Skips creation if blank)
- Added schema for Dynamic Vocabularies
- Extend processing rules to support an ingestion rule (AB#3916)
- Add table scripts for all micro services
- Add tables to track dataset endpoint ingressions
- Add upgrade notes
Server
Docker Image |
Tags |
cluedin/cluedin-server |
3.3 , 3.3-alpine , 3.3.0 , 3.3.0_35562 , 3.3.0_35562-alpine , 3.3.0-alpine |
cluedin/cluedin-server |
3.3-ubuntu , 3.3.0_31367-ubuntu , 3.3.0-ubuntu |
cluedin/nuget-installer |
3.3 , 3.3-alpine , 3.3.0 , 3.3.0_35562 , 3.3.0_35562-alpine , 3.3.0-alpine |
cluedin/nuget-installer |
3.3-ubuntu , 3.3.0_35562-ubuntu , 3.3.0-ubuntu |
Fixes
Configuration:
Key |
Default |
Notes |
DataStores.Graph.MoveRelationships.Limit |
10 |
How many relationships is moved within a single batch |
- Fixed
Neo4jEntityGraphDataStore.MoveRelationshipsByNodeId
runtime database error
- Change organization data shard caching to use 10 min sliding expiration instead of 1 min absolute expiration
- Add configuration keys to be able to configure organization data shard caching policy
Configuration:
Key |
Default |
Notes |
Caching.Organization.DataShard.SlidingExpirationSeconds |
600 |
Sliding expiration in seconds |
Caching.Organization.DataShard.AbsoluteExpirationSeconds |
|
Absolute expiration in seconds in the future from current date |
- Do not allow update of vocabulary name, if name already exists (AB#7435)
Features
- Added the ability to ‘Clean with history’ when creating a cleaning project.
- Added the ability to disabled the IDataContentExtractors in bulk using ClueProcessing.IDataPartProcessor.Enabled set to false. The PII/Sensitivity detection in CluedIn can be disabled through this. If you do not need to use this feature you will find that disabling it will increase performance in CluedIn.
- You can now force the Global Data Quality Metrics to run through a REST API call instead of waiting the 24 hours for the next job to run.
- All Glossary Searches will now search through the Long and Short Description as well.
- Added a new Search Endpoint to allow for paged term results
- Added a new EntityInfo Endpoint to allow for paged results
- Improved crawling throughput when lots of rules exist
- Normalized depedency tree to enable public nuget feed
- Added User Input Version Branch Evaluator
- Added new endpoint for adding a user to a role without sending the email confirmation
- Rule activation ordering starts from 2(AB#6797)
- Authentication certificate set from environment variables (AB#6011)
- Added ability to upgrade Enrichers to the new Provider Model (AB#7729)
Configuration:
Key |
Default |
Notes |
ClueProcessing.VersionHistory.BranchEvaluator.UserInput.Enabled |
True |
Turn this feature on or off |
ClueProcessing.VersionHistory.BranchEvaluator.UserInput.ExcludedInputSources |
|
Specify semicolon separated data part input sources to ignore |
ClueProcessing.VersionHistory.BranchEvaluator.UserInput.ExcludedEntityTypes |
|
Specify semicolon separated entity types to ignore |
- Added configuration keys to be able to control how merge entities user action should persist the user preserved values
Configuration:
Key |
Default |
Notes |
UserActions.MergeEntities.ApplyTargetEntity.Properties |
True |
Apply all property no matter if they changed or not |
UserActions.MergeEntities.ApplyTargetEntity.Codes |
True |
Set all entity codes of the target entity instead of only the origin entity code |
UserActions.MergeEntities.ApplyLastChangedBy |
True |
Set last changed by if available |
UserActions.MergeEntities.ApplyChangeVerb |
False |
Apply change verb |
UserActions.MergeEntities.ChangeVerb |
Merged |
The change verb to apply |
- Added the ability to configure metrics settings through the API. This gets stored in the DB but if a configuration file has a different value to the default value that value will override what is stored in the DB.
- Added Enricher Providers (AB#3531)
- Added a new EntityInfo Endpoint to allow for paged results
- Extend processing rules to support an ingestion rule (AB#3916)
- When uploading a file which is mapped to a “new” vocabulary key, the value does not work with the “TERM” search (AB#6108)
- Create an endpoint that returns if Vocabulary keys exists or not (AB#5258)
Submitter
UI
Docker Image |
Tags |
cluedin/ui |
3.3 , 3.3.0 , 3.3.0_35541 |
Features
- Mapping screens in data sources has been redesigned to offer better visibility on the Mapping of the orignal fields of the data set.
- Auto-mapping will now be able to generate the Vocabulary and Vocabulary keys for you.
- You are now able to create Vocabulary keys during the mapping phase of your Data Set.
- New Data Catalog section under Management, you can know view a list of your Vocabularies and Vocabulary key.
- The list of vocabulary supports filtering by Entity type, Integration and search by name.
- The list of vocabulary keys supports filtering by type, by integrations, by classification and search by name or key.
- Core and custom Vocabularies can now be edited.
- Core and custom Vocabulary Keys can now be edited.
- You can create your own Vocabulary.
- You can create your own Vocabulary Key.
- You can edit Voabularies.
- You can edit Vocabulary keys.
- New Usage Tab under a Vocabulary page to see where the vocabulary is being used.
- Owners Tab under a Vocabulary page to define who is the owner of the vocabulary.
- New Usage Tab under a Vocabulary Key page to see where the vocabulary key is being used.
- All values tab under a Voabulary Key page to see what are the differen values for a given Vocabulary key.
- Lineage Tab under a Vocabulary Key page to see the mapping from source to stream and how the data is flowing.
- You can now add (or change) the mapping of a Vocabulary key.
- You can now rename a vocabulary key.
- A new section called Entity Types has been added in the Management pillar.
- You can now see the list of all Entity types in the application and search them by name or code.
- You configure the Entity Type configuration (icon, displayName…) from multiple places.
- You can now create your own Entity Type
- Under the Entity type page, you have a Data tab showing all the entities that has the Entity type as type.
- Under the Entity type page, you have now a Vocabularies tab to see all the vocabularies are attached to an Entity type.
- Rules now support multiple actions.
- Rules now support sub filters for each action.
- New settings to enable/disable metrics (global or individual) has been added.
- A new pillar called ‘Engine Room’ has been added to the main menu.
- In the Engine Room, you can follow the status of the different main processing stage of the data pipeline.
- In the Engine Room, you can now see high level statistics on the Cluster health.
- In the Engine Room, you can now drill down to some specific services to see basic statistic (Search Db, Relational Db, Graph Db…)
- In the Engine Room, you can now see how the CluedIn instance is being setup in a configuraiton point of view.
- In the Entity page, you have now a tab called “Explaing Log” which helps you debug/understand how the Golden record is being produced.
- Enricher can now be added and edited, if the Enricher supports configuration (like API Token), you will be able to enter them from the UI (it was in config file before).
- Enricher supports permissions the same way as Integrations and Data Sources.
- The UI now has the ability to run in multiple languages.
Fixes
- In duplicates section, only call the Queries endpoint to have faster result on what duplicate queries are installed.
- You can now map multiple Edges for the same property.
- The reports of the processing have been improved to be more accurate (total and progression).