Skip to main content

Cross-instance Padif analyses

The major novelty with this release is the possibility to perform Padif analyses which include not only documents from your own instance but also from other ProFID instances.

This feature will allow users to explore possibly related cases in other countries, empowering them to further their investigations by reaching out to their colleagues.

Disclosed data

As a preamble, it is already worth mentioning that the feature only discloses the strict minimum of metadata about documents and series to users of other instances:

  • The document's sequential identifier, containing the document's creation year and a sequential number.
  • The identifier and name of the series any of the documents belong to.
  • The name and email of document creators and series managers. Those are disclosed so that users can contact the right person in case a document or a series is of particular interest.

In particular, users don't have access to any images related to documents from other instances. Any computation required to calculate the similarity between documents is done privately on the server.

Performing a cross-instance analysis

Prerequisite

Not all Padif analyses can be configured with external instances.

The document set of that analysis must be dynamic, that is, rely on filters rather than hand-picked documents. It also must not have labels, series or services as filters, as those are instance-specific features of a document.

To start a cross-instance analysis, go to an existing Padif analysis and select the "External instances" action from the dropdown menu.

Dropdown menu with the new option for configuring external instances.

Dropdown menu with the new option for configuring external instances

Then select from the list of available instances the one you want to include in the analysis. The number in parentheses shows you the number of documents that match the document set filters of the analysis in the other instance.

The external instance configuration dialog.

The external instance configuration dialog.

Once saved, it will take about 30 minutes or more until you see the result of the cross-instance analysis, depending on when the next run is scheduled, how many other analyses are in queue and how many documents and regions of interest the analysis has to be performed on.

For each external instance selected, you will see a new tab containing the usual Padif dendrogram, but with documents from the other instance listed, as well as their associated series.

Report including documents coming from one's own instance and an external instance.

Report including documents coming from one's own instance and an external instance.

Opting out

By default, all instances except for ProFID series (which does not hold documents, only series), can be included in cross-instance Padif analyses.

It is, however, possible to opt out of sharing by emailing us at support+profid@zakodium.com.

Opting out will prevent both other instances from using your data and yourself to use the data of any other instance.

Other Padif improvements

New ROI edition interface

The page to edit regions of interest (ROIs) for Padif analyses has been completely redesigned.

You can now work on all the ROIs at once instead of removing and adding them one by one. It works similarly to the page for editing anonymized regions when importing scans.

You can also choose the color of the ROIs, which was not possible before.

The new ROI edition interface.

The new ROI edition interface.

Status information

We added context on the state of a Padif analysis, so that you are aware when things are missing or in an errored state in the report:

  • The number of documents included in the report.
  • The time at which the last execution happened, or if the analysis is being currently executed.
  • The list of documents which are part of the analysis but aren't in the report, along with the reason why.
  • The number of regions of interest which are configured in the analysis but are not yet part of the report.

The new status information in the Padif report.

The new status information in the Padif report.

Performance improvements

The Padif analyses are computationally expensive, and it can take some time to generate the report.

We have audited the performance bottlenecks and made some improvements to fasten the process.

Security improvements

More minor security concerns have been fixed based on the second security audit. Notably, users are now required to set a password with at least 10 characters, and users can see feedback on the strength of the password they chose.

Password strength feedback.

Password strength feedback.

Bug fixes and minor improvements

Minor improvements

  • The activation / deactivation of a Padif analysis now has to be confirmed to make the operation safer.
  • We facilitated bug reports by providing a direct mailto link in error alerts.
  • For admins, we added the ability to filter users by role.
  • The scan import process was made more robust, handling the presence of a white crease at the top of the canvas.
  • In the document suggestions tab, if the document's serial number matches with others in the database, provide basic information about those documents and a link to them.

The document suggestions tab showing a document with a similar serial number.

The document suggestions tab showing a document with a similar serial number.

Bug fixes

  • We fixed an error which prevented labels from being renamed.
  • We prevented ROIs which exist on the Padif analyses but are not in the report, to be selected in the report.
  • We prevented the deletion of a document if it is a reference document of a Padif analysis.
  • When viewing a document, the suggestion tab lists the Padif analyses in which the document is involved. We fixed a bug which made the list empty even when there were linked Padif analyses.

Sharing series

It is now possible to share series with other ProFID instances, by securely exporting them and letting the recipient import them into their own instance.

Exporting a series

The new "Share series"

The new "Share series" menu item in the series page

When sharing a series, you will be invited to enter a password to protect the file, and to choose which data you want to include in the export:

  • Series images
  • Series attachments
  • Last published alert

General information like the name of the series, as well as the series material and contextual profiles, are always included in the export.

info

The export will not contain any information about the documents linked to the series, except for the number of documents linked and the span of seizure dates of those documents at the time of the export.

Importing a series

With a .pfd file in hand, the recipient can go to "Series > Import series" in the top menu, and upload the file.

Provided that the password is known to the recipient, they will be able to import the series into their instance.

Preview of the series before importing it.

Preview of the series before importing it.

The new identifiers created for the imported series, are generated in the same way as regular series. They will contain the instance code of the instance in which the series is imported, the year and an incremental number. However when going to the series' page, you will notice that they will contain a reference to the exported series.

Imported series have the same permission rules as any other series, and they can be edited afterwards or have documents linked to them.

Series import / export is the first step towards a more open and interconnected ProFID. We hope that it will foster collaboration between countries and allow for more links to be established, helping the fight against organized crime.

New Padif

The new Padif, the automated document comparison algorithm to help ProFID users attribute documents to series, is now out of experimental phase and generally available to all users. The old version of Padif has been removed.

Padif rewrite

Padif, originally written in Python by Sydney Barthe, who was a student at Ecole des sciences criminelles, University of Lausanne, did not evolve since it was first integrated into ProFID, since the technology stack is very different from the rest of ProFID. Some known bugs were never fixed.

The new Padif, entirely rewritten in JavaScript, is more tightly integrated with the rest of the codebase and will give greater autonomy to the development team to fix bugs and evolve the algorithm. It improves some key parts of the process. For example the algorithm for aligning regions of interest in between documents has been rethought and improved.

Portion a dendrogram generated by the new Padif algorithm

Portion of a dendrogram generated by the new Padif algorithm

New Padif features

The new Padif allows for regions of interest to be included / excluded from the results, without the need to re-run the comparison. This will make the process of finding the right regions of interest for the analysis less tedious. It also allows to choose between different distance aggregation algorithms for creating the dendrogram.

The new Padif advanced settings

The new Padif advanced settings

Security improvements

A second security audit was conducted on the ProFID software.

Various minor security concerns have been identified and fixed to make ProFID even more secure.

Bug fixes

  • Fix incoherent filter indicators in scan list page.
  • Improve guiding borders around document QR codes to ensure they are cut correctly.
  • Make sure only admin or editors have access to the list of ProFID users.

Notable changes

New workflow to import scans

info

This is an overview of the new import workflow. For more in-depth information, refer to our new tutorial video as well as to the new documentation on how to add anonymized regions manually.

This workflow is split into 4 steps:

  1. Extraction
    Finds the borders of the document and extrapolates the best coordinates and angle to crop the document.
  2. Rotation
    Using a machine learning model, re-orients the document so that it is upright.
  3. MRZ extraction
    Using machine learning models, finds the MRZ and reads its characters.
  4. Anonymization
    Using machine learning models, finds and prepares the anonymization of faces, signature, barcodes and MRZ text on the document.

Each step can be manually corrected if needed.

If an MRZ is found, the user must validate that it was correctly read.

caution

The original scan is now deleted once the import workflow is finalized and the document is imported.

QoL improvements

  • Document sets can now be deleted if they are not associated with a Padif analysis.

  • Display creator and editor details (date, time, mail link) on Scan, Document set, Label, Padif analysis, and Series.

  • The web page title (in browser tab) is updated with your navigation. Now it's easier to differentiate multiple ProFID opened tabs.

    Three browser tabs with precise titles

    Example of updated tab titles.
  • Improved the scan details page.

  • Label's descriptions are now optional.

Bug fixes

General

  • Fixed a bug that prevented the edition of documents from Kosovo. The Kosovo country code is now correctly set to XKK.
  • When an unexpected error happens, pages with tabular data will now show the error instead of the previous data.

Document

  • In document standardized image edition page, ensure text in buttons do not wrap to improve readability.
  • Labels filter displays correct associated documents count.

Admin

  • In users management pages, replace wrong links in breadcrumb from /users to /admin/users.

Notable changes

The page layout has also been improved and made more consistent.

navigation-breadcrumb.png

Navigation breadcrumb for document edit page

Breadcrumb segments link to the parent page.

Improve ROI creation in Padif analysis

The ROI creation module has been completely revamped. It now includes a preview of existing ROIs and a usage guide.

Padif-create-roi.png

New page for ROI creation

Pages with tabular data come with search, sort and filters

The ability to search, sort, and filter has been added to the following tables:

  • Scans
  • Padif analyses
  • Users
  • Services
  • Seizure authorities

Security fixes

ProFID has been audited by an independent security company. This release includes various fixes and best practices to improve the security of the application.

One of these changes might impact users: the allowed file types for uploads are now restricted.

  • For scan uploads and document extra images: PNG, TIFF, and JPEG.
  • For series attachments: PNG, JPEG, BMP, PDF, and TXT.
  • For series alert image uploads: PNG and JPEG.

Other small improvements

If you encounter any unexpected error in the application, it will show you a unique request identifier. Please include this identifier if you make a support request.

Manual and dynamic document sets

Manual document sets have long existed in ProFID, but lacked some key features and were not very easy to construct. This release significantly improves the usability of setting up document sets, and introduces dynamic document sets, a new type of document set which automatically updates the list of documents based on user-chosen filters. This opens up new use cases for Padif analyses, where new documents can be compared to other documents in the set as soon as they are added.

List of document sets

The list of document sets is now presented in a table, with search, sort and filter features.

list-of-document-set.png

List of document sets

Manual document sets

Manual document sets used to be hard to construct because:

  • The metadata of documents was not accessible from the document set interface.
  • The list of documents was not searchable or sortable.
  • It was not possible to exclude documents, which made them appear among valid candidates.

All those issues have been addressed in this release. Document set members are now displayed in a table with metadata, candidates can be searched or sorted, and documents can be excluded from the candidate list. Candidates are managed in a large slide-over panel. On reasonably large screens, this allows to see the documents already in the set and candidates at the same time.

Excluded documents can still be re-included to the candidate list if a mistake was made.

manual-doc-set.png

Managing the document set's document list

manual-doc-set-candidates-edit.png

Managing candidates

Dynamic document sets

Dynamic document sets is a brand-new feature which allows users to have document sets populated without having to manually add each document to the set. The principle is simple, you select a set of filters similarly to how you would search for documents, and the set will automatically be populated with documents matching the filters.

Similarly to manual sets, it is possible to exclude documents from the set. Doing so will prevent the document from being part of the set, even though it matches the filters.

As mentioned in the previous release, there are valid use cases for using labels as filters for dynamic document sets, and this is now possible.

Documents of a dynamic set are displayed in a similar manner than manual sets.

dynamic-document-set.png

Dynamic document set view

The filters of a document set can still be edited after the set has been created.

dynamic-document-set-filters-edit.png

Dynamic document set managing filters

Other document set improvements

We also added a couple of other improvements to document sets:

  • Document sets can be renamed
  • A document can be added to a manual set directly from the document's page.

Padif

The selection of the reference document has been improved. The images are no longer obstructed by labels and additional information about the document is displayed.

Padif-select-reference-document.png

Dynamic document set managing filters

Documents

  • It is now possible to assign / remove labels to / from multiple documents in one action from the document list page.
  • The search input in document list page is now split into terms which can be looked up in all relevant fields. The result contains the items which matched all terms.

Series

As for the document list page, the search input in the series list page list is now split into terms which can be looked up in all relevant fields. The result contains the items which matched all terms.

Alert

New toolbar button to select predefined sentences in "Measures" field.

Other small improvements

  • Display a warning when the user attempts to change the scope of a label.
  • Invalidate user session when an admin resets the user's password.
  • Padif analyses can now be deleted.

Bug fixes

  • Add and fix translations on series alerts.
  • Fixed a crash in the alert edit page which occurred in some circumstances when the edited language was changed.
  • Fixed an error in the alert page on rapid identification image edition modal form which occurred when the form was submitted without setting the image.
  • Publishing an alert now changes the edition date of the series.
  • Fixed invalid values on home stats page when no documents were assigned to any series.
  • Allow removing service from users in the admin list of user page.
  • Fixed the behaviour when clicking on label. The list of documents is now filtered on this label as expected.
  • Fixed missing count for the unspecified option in filters.
  • Fixed the download button on the scan details page. It was downloading a lighter version instead of the original one.

ProFID series alerts

This is the biggest addition to this release. There is a dedicated article on series alerts.

Document labels

A label is an arbitrary keyword that can be associated to any number of documents. This is useful for further characterizing documents with attributes which do not exist as fields in the document. One key advantage of labels is that they are searchable from the document list page.

Before you start creating labels, it is important to concert with other users of your ProFID instance to agree on a common set of labels. This will ensure that labels are used consistently and that they are useful to everyone.

Labels can be used for a variety of purposes. For example, you can use them to organize work: a non-expert might add the "requires analysis" label to documents which need input from a document analysis expert. The label can then be used by experts to filter the document list page to only show documents which require analysis.

Furthermore, labels can be scoped. Scoped labels are special labels which have a constraint on the document type, country, and/or fraud type on which it can be applied. One potential application of scoped labels is to categorize documents based on their comparability in terms of automated image processing. For example, one might create a scoped label that only applies to Swiss IDs and differentiates the "2023 version" of the document from the older ones which have a radically different design. In future versions of ProFID, categorising documents this way will enable the use of "smart" document sets, which will automatically update based on those labels or other filtering criteria. The update of document sets will automatically trigger Padif analyses to be re-run.

Rich text editor

Until today, it was (almost) not possible to format text in ProFID. This is now officially possible thanks to the new rich text editor. It allows a wide range of possibilities to format text: headers, text alignment and colours, bullet points, etc. It is available in the following places:

  • The different alert fields: general information, rapid identification element and measures.
  • The series material and contextual profile fields.

The previous version of ProFID allowed to use a syntax called "markdown" to format text in the series material and contextual profile fields. This was a non-documented feature but was used by some users. Those fields have been migrated to the new rich text editor for seamless transition.

Rich text editor

Example of the rich text field

New dedicated pages for images and attachments

ProFID now features a new dedicated edition interface for standardized and extra images on documents, as well as images and attachments on series. This new user-friendly interface creates a clearer delimitation between view and edit pages.

Prevent lost work

ProFID has forms to fill data about documents and series. It is not uncommon to spend time filling a form only to accidentally lose everything by navigating to another page or by closing the browser's tab. ProFID will now warn you if you are about to lose your changes.

Risky navigation modal

You will be asked for confirmation during risky navigations

Improved session restoration

For security, ProFID logs you out after 30 minutes of inactivity. An overlay inviting you to log in again appears on the page when that happens. When multiple tabs are open, you had to log in again in each tab. This is no longer the case: you will only have to log in once and all tabs will be automatically restored!

Improved document and series lists

We show more fields in the document list (document number, operation) and on series (aliases). Before this addition, some fields were searchable but not displayed in the results, creating confusion.

New type of light for extra images

We added 2 new types of light for extra images, in alignment with the FADO terminology:

  • Coaxial
  • Oblique

Other new pages

  • We added a new scan details page available from the scan list.
  • We added a standardized image details page available from a document detail's page. Hover the image to access the link.

Bug fixes

  • Fixed a bug which allowed multiple users with the same email address to be created, if they were written with different casing.
  • Fixed erroneously updating document's date of addition to a series when non-series fields are updated.

ProFID series alerts

ProFID series alerts are a long-awaited feature which we hope will bring a great operational value to the project.

What are series alerts?

Series alerts are PDF documents generated through ProFID which are meant to be disseminated to raise the attention on an active series. The goal of such alerts is manifold:

  • Inform that a series has been detected by a law enforcement agency and that other agencies may also be concerned
  • Let people know about the ongoing investigative measures, the need to collect and share information related to the series and its underlying organised crime group.
  • Improve detection of documents, which helps first and second line officers fight against document fraud and the organised crime group related to that series.
  • Provide guidance to officers on the behaviour to adopt towards the document holder when facing a document belonging to a series.
  • Facilitate the gathering of critically important intelligence in the context of an investigation related to a series.

How to create a series alert?

Series alerts are defined through a set of 4 fields:

  • General information: This field is meant to provide general context around the series, focusing on what is relevant to the alert. It can be for example a sum-up of the investigative results to date or the profile and routing of the holders of these documents.
  • Rapid identification image: An image which illustrates what the specific features of this series are and where they are located on the document, usually based on a typical example of a document belonging to the series.
  • Rapid identification element: A list of check points to identify the document as belonging to the series. It should refer to the "rapid identification image" for context. The most specific and obvious elements should be listed first.
  • Measures: What one is expected to do when they identify a document belonging to the series. For instance, it can be to inform agent X from law enforcement agency Z that a new case has been detected, to search the smartphone of the holder of the document to find information on where and to whom it was bought, or how first and second-line officers should behave when facing a document belonging to the series (such as "Consider the holder as a possible terrorist").

Alerts can be translated into multiple languages.

Rapid identification image

Example of a rapid identification image for the "BROKEN-SCEPTER" series.

Click here to view a full example of an alert.

The series alert lifecycle

Alerts go through the following lifecycle:

  • Not published (draft): The alert is created and being edited. It awaits to be published for everybody to see.
  • Published: The alert is published and can be viewed by everybody.
  • Published with draft: The alert is still published but a new draft revision is being edited.
  • Published: The new revision is published and can be viewed by everybody.
  • Archived: The alert is no longer relevant and is hidden from users.

A series can only have one non-archived alert at any given time. Once an alert is archived, a new alert can be created for the series.

Who can create series alerts?

Any user with the editor role can create and edit any alert. Users with the viewer role can only view published alerts.

Sequential ids for documents and series

From now on, sequential identifiers are automatically assigned to documents and series. They contain the character "D" or "S", the year of creation and a number that starts at 1 and is reset every year.

Additionally, the identifier for series contains a three-letter code for the ProFID instance that created it (usually a country code). This allows to share series identifiers between ProFID instances with no ambiguity.

  • Example of a document identifier: D-23-0001
  • Example of a series identifier: S-DEU-23-001

Other changes

  • Series are no longer created while describing a document. If a document is associated to a series that doesn't exist yet, the series will be created when the document is saved.
  • In the series list, the "Updated at" column no longer reflects all changes to a series. This field is now only updated when the series' list of documents is modified or when its contextual or material profile are updated.
  • In documents and series search filters, clicking the "Reset" button now also resets the text search term.
  • Various bug fixes and improvements to the user interface.

In this release, we have added the possibility to rename series and made a few improvements to the user interface.

Renaming series

It is now possible to change the name of a series:

Rename series button

When a series is renamed, an alias is automatically created with the old name. It can be removed afterwards by editing the series.

A series can have multiple aliases. They are displayed in the series's page and search boxes now also search within aliases in addition to the series names.

User interface improvements

  • For both series and documents, the main actions you can do are now all located on the top-right of the page:

    Document and series actions

  • We also changed the way standardized document images can be rotated or deleted. The buttons now appear when going over the image with the mouse.

Bug fixes

  • Fixed a bug which was preventing to submit forms with a validation error. It is no longer necessary to refresh the page to be able to submit the form again.

With ProFID becoming more and more widely used as part of its expansion to many EU countries, it's essential for users — especially when onboarding — to have a reference documentation that is up to date and on which they can rely. We are proud to announce a major change to ProFID's reference documentation.

A brief history

Before we dive into it, let's briefly recap how the documentation evolved since it first launched.

  • Originally, back in 2017, it was a Word file in French, shared to users but not available on the website. At this time, there were only a few users in Switzerland initiated to the system so this was manageable.
  • Around mid-2019, as the system gained momentum and started to onboard more users in a multi-lingual Switzerland and abroad, the documentation got translated to English.
  • In december of 2020, the documentation was made available to users directly from the website by displaying an embedded PDF file. Anyone with an account could now have access to this precious knowledge database!

Today we are hitting a new milestone. The documentation has been migrated to fully take advantage of the web, unlocking useful functionalities and providing a better user experience. We also filled gaps that existed in the previous documentation.

New features of the revamped documentation

New guides

ProFID has evolved quite a bit since the original redaction of the documentation. As a consequence, some of the features of ProFID were not documented at all. In this new version, we filled the gaps:

Now, in ProFID documents can not only be added but also edited and deleted. Thus, these features, which are indispensable for good user experience have been properly explained and documented.

Documentation about images has also been reworked and improved. Working with and deleting of scanned images have been explained and illustrated. A new feature of adding extra images (images that do not require scanning) and working with them has been documented too. Extra images will allow improve ProFID database by making it more detailed.

To the already existing documentation about series creation, we also added illustrated explanation about series merging and deletion.

  • Administrator's rights

In order to avoid confusion, we decided that it is worth mentioning the difference between user's capabilities and documented them. Now there is a special section dedicated to administration of ProFID and its extended features such as setting up the system for the first time or user management.

Added depth

Our new documentation explains certain aspects more thoroughly and in details. Moreover, multiple aspects of the documentation have new step-by-step solutions that will, hopefully, lead the user to a better understanding of concepts and their applications.

Better structure

The documentation is now divided into different major sections which are also sub-divided on separate pages explaining each aspect of working with ProFID. It enables to navigate and search the docs more efficiently.

Search feature

The new site includes a search bar that allows to navigate to the appropriate documentation page based on search terms.

Try it out!

You can access the User guide and the Videos from the top menu bar. The Updates page is also available from there and it will be updated when new features other changes are deployed to ProFID.

Feedback

We hope that those changes are to your liking. If you wish to comment or provide feedback on the new documentation, feel free to contact us at support+profid@zakodium.com.

info

As part of this change, we are also removing the French translation of the documentation. From now on, the documentation is now only available in English. With a constantly evolving tool, maintaining a documentation in multiple languages is challenging and requires a lot of effort, so it was decided to prioritize the development of new features over the internationalization of the documentation.