Introducing: Confidence Scores

By  Markus |
Introducing: Confidence Scores

Not every duplicate is problematic: experts differentiate between appropriate and inappropriate cases. Appropriate duplicates are not research integrity issues, while inappropriate ones are integrity issues and can lead to a paper’s rejection, correction, or retraction. With our latest update, we display confidence scores from 0% to 100% for each detected duplicate. A high confidence (e.g., 99% in the example above) means that the finding is likely inappropriate. In contrast, a low score (e.g., 2% in the “merge” duplicate example) means the case is likely appropriate. The confidence score helps you to quickly separate between relevant and irrelevant cases.

New user interface

In the updated user interface, each duplicate is accompanied by a confidence score. A slider can be used to filter duplicates based on a specific confidence threshold. We differentiate between three ranges: low (0%-32%), fair (33%-65%), and high (66%-100%). By default, the slider starts at 33%, displaying fair and high cases. We recommend reviewing all fair and high cases. To deepen your investigation, you can examine duplicates in the low range by adjusting the slider. Based on the current filters, we summarize how many findings are currently shown/hidden. All currently shown findings are included in the PDF report.

Appropriate vs. inappropriate

We compute the confidence/relevancy of a duplicate using several features. Features include image similarity, image class (e.g., radiology, spot image, or western blot), duplicate type (e.g., duplicate inside the same paper or across papers), and many others. Based on these features, we derive a confidence score for each duplicate, representing the appropriateness of a duplicate. We measured the effectiveness of our algorithm on inappropriate duplicates posted on PubPeer mixed with appropriate ones from randomly sampled publications.

Appropriate examples are versatile, such as two images of the same microscopy image with different zoom factors, images with different color channel overlays (merge), or radiology images showing a brain scan with different color injections. We analyzed 5068 duplicates, consisting of 1797 inappropriate and 3271 appropriate duplicates. Of the 1797 inappropriate duplicates, we correctly classified 1733 cases (i.e., we predicted a confidence within the range of 33%-100%). Of the 3271 appropriate duplicates, we correctly classified 2936 cases (i.e., confidence in 0%-33%).

Bulk processing

The differentiation between low, fair, and high-confidence duplicates can help efficiently bulk process large paper volumes. By knowing which papers contain duplicates with a fair or high chance of being inappropriate, users can specifically investigate these duplicates. Our API allows bulk processing of hundreds of papers, and during the next week, we will update our API* to include information on whether low, fair, or high-confidence findings were detected in the scanned documents. Even when scanning hundreds of papers, only a small manual effort is necessary to detect inappropriate duplicates at scale.

Protect Research Integrity with Confidence

Start using Imagetwin to detect image integrity issues and support trustworthy research publishing.

Frequently asked questions

Imagetwin is software designed to detect integrity issues in figures of scientific articles. It helps identify inappropriate manipulations and duplications in various figure types, including western blots, microscopy images, and light photography.

Imagetwin is beneficial for researchers, peer reviewers, journal editors, and institutions aiming to uphold the quality and trustworthiness of scientific publications by ensuring the integrity of visual data.

Users can upload a PDF or multiple image files to Imagetwin. The software then scans the content using algorithms and vast databases of published scientific figures to detect potential integrity issues. Within seconds, results are presented through a web interface, highlighting any detected problems for review.

Yes, we prioritize data privacy and security, ensuring that all image indexing and exchanges are protected with industry-standard encryption and security best practices.

Create an account and start using Imagetwin immediately. We prepared a few example documents that you can scan free of charge.

Yes, Imagetwin is a powerful addition to the peer-review process. It automatically detects various integrity issues, which can then be quickly verified by a reviewer, enhancing the efficiency and accuracy of the review process. Imagetwin also partners with industry leaders in publishing and scholarly workflows, such as Morressier, TNQ Technologies and more, transforming how research is submitted, reviewed and published.

For more detailed guidance on using Imagetwin, contact our support team through our Contact Us page.