The first action to modernize an Information System: Eliminate unnecessary pipelines?

Obtenir le lien
Facebook
X
Pinterest
E-mail
Autres applications

janvier 21, 2025

The first action in modernizing an Information System:

Eliminate unnecessary pipelines?

A data system could be compared to a vast road network, composed of various roads, each created to meet a specific need at a given time.

As this network expands and ages, some paths (data pipelines) become underutilized or unused (replicated, obsolete).

The financial and organizational impacts are numerous:

60% of data in the Cloud is not being used, according to NTT.

Source: IT Social

According to Civo, for almost half of companies with more than 500 employees, the annual cost of the Cloud exceeds one million dollars, with growth rates that are difficult to sustain.

Source: Channel News

The origin of these unnecessary data pipelines,

These "phantom roads"?

Over time, Information Systems aggregate pipelines that have become obsolete:

Pipelines created for projects that have now been abandoned.
Duplicates of pipelines, due to a lack of coordination between services. The advent of "data mesh" architectures appears to be a major accelerator of this situation.
Obsolete pipelines kept as a precaution ("you never know!"), or to cover some risk.

…
These "ghost lanes" consume a lot of resources in the Cloud (storage, processing, bandwidth), which could be used differently!

We have made progress on a software solution that addresses this natural drift, as old as physics: entropy, i.e. the "degree of disorder reflecting the natural tendency of things to evolve towards a state of chaos".

This drift is not inevitable. On the contrary, it is a race against time, because the systems have such a propensity for entropy that only industrialized mechanisms can cope with it.

This response is one of the features of {openAudit} . It uses two mechanisms:

Technically and continuously identify pipelines to be decommissioned

It is possible to accurately map these complex entanglements and identify unused pipelines.

This process requires two coordinated technical actions , which we offer with our {openAudit} software :

Analysis of data usage: to identify "informational dead ends".

{openAudit} will analyze the main technical stack to find out all the data consumed in and out of the batch chains.
Data consumed by satellites (non-parsed applications) is also analyzed to identify the completeness of useful information.
This dual analysis can be subtle and will be configured to take into account the target business : regulatory information may be consumed very periodically, for example, while still having significant added value.

Through a "mirror analysis", informational dead ends are factually defined in continuous time.

Data Lineage: Tracing data streams to isolate unnecessary chains

Data lineage allows you to trace the pipeline back from unused data to the first table that will be the source of information consumed in another branch.
From this branch, it is possible to remove the unnecessary chain fraction without consequence.

Clean the Information System

The {openAudit} run is operated continuously, which allows the decommissioning of all unnecessary flows to be organized over a long period with internal teams.

A further classification can be carried out by profession, tools, other, to prioritize the process.

Modeling a harmonious system

We are currently developing an algorithm, which we have named " Harmony , " that will allow for the automated modeling of a system to be as rational and efficient as possible, even when many proprietary technologies are in use (ETL, data visualization tools). More news to come!

If you wish, we can arrange an individual video call (15 to 20 minutes) in the coming days to discuss these topics in relation to your own issues.

To book a time slot, follow this link:

Reserve a 20-minute slot in my calendar

And if you would like to discuss automated migration topics related to the following themes (or others), we would be happy to do so as well:

"Migrating from Talend (or any other ETL) to DBT"

"Migrating from SAP BO to Power BI in an automated way":

"Transitioning from DataStage to SQL in ELT or ETL mode"

" 4 options for changing data visualization tools"

Learn more

CONCLUSION

By eliminating "ghost routes" and streamlining your pipelines, you can reduce your costs, optimize your human resources and strengthen the efficiency of your Information System.

The approach of combining usage analysis and data lineage makes it possible to transform a complex network into a fluid and efficient ecosystem.

The implementation of "Harmony" could help finalize the construction of a harmonious and sustainable data-driven future!

ITdebt simplification

Obtenir le lien
Facebook
X
Pinterest
E-mail
Autres applications

Commentaires

Enregistrer un commentaire

Posts les plus consultés de ce blog

Power BI libère les utilisateurs… Mais comment garder la maîtrise de sa plateforme dans le temps ?

avril 28, 2025

Power BI libère les utilisateurs… Mais comment garder la maîtrise de sa plateforme dans le temps ? Power BI est en train de créer un précédent dans le monde de la dataviz : pratiquement toutes les entreprises importantes du monde l’ont déployé comme solution corporate ou, a minima , comme solution d’appoint (pour certaines lignes métier) – et on le comprend : licencing attractif, ergonomie, intégration native avec l’écosystème Microsoft (Excel, Azure, SQL Server, Microsoft 365) déjà présent presque partout. Cette accessibilité a un revers : à mesure que la plateforme s’ouvre aux utilisateurs, les dashboards se multiplient, les sources se diversifient, la complexité s’installe. La gouvernance de la plateforme est d’ores et déjà un défi – ou le deviendra. C’est dommage quand on sait que l’adoption d’un nouvel outil de dataviz est une opportunité pour reconstruire sur des bases durablement saines, pas pour créer de la dette à vitesse grand V. C’est dans cett...

De la source à la cellule du dashboard : Cartographier le SI pour le reconstruire intelligemment

mai 22, 2025

De la source à la cellule du dashboard : Cartographier le SI pour le reconstruire intelligemment Dans les projets de transformation du SI, il y a une injonction qui revient sans cesse : « Il faut migrer ! » … vers le Cloud, vers des architectures modernes, vers des solutions plus agiles, etc. On a vu éclore ces dernières années de nouvelles solutions data qui sont en train de tout balayer sur leur passage : on pense par exemple à Power BI pour la dataviz, à dbt pour la gestion des transformations ou à Snowflake pour le stockage : simples, efficaces, scalables, Cloud natives, souvent bon marché, etc. Quand on a décidé de passer à l’action, quelques questions viennent à l’esprit : « qu'est-ce qu'il faudrait migrer au juste » ? « Et donc, qu’est-ce qu’on peut jeter ? » « Et comment reconstruire ce qui a de la valeur dans la cible sans tout casser ? » Ci-après, une approche basée en 1er lieu sur l'a...

Migrer de SAP BO vers GCP Looker - Garder ses données en source ? Possible ?

septembre 23, 2025

Migrer de SAP BO vers GCP Looker - Garder ses données en source ? Possible ? BigQuery de GCP a une solide réputation en matière de base de données Cloud. Certaines entreprises vont opter pour le combo BigQuery + Looker, qui est proposé à des conditions ultra avantageuses. Dans ce cadre, quelques uns réfléchissent à une migration de SAP BO vers Looker. Et ils ont bien raison ! Les aficionados de la couche sémantique s’y retrouveront et BigQuery garantira des temps de réponse incroyables. Si les données en source ont été migrées dans BigQuery, c'est parfait ! Dans ce cadre, nous préconisons de migrer la couche de dataviz As Is de SAP BO à Looker (dashboards reconstruits, prompts redéfinis, data model transposé en LookML, etc.). Use case : " Adeo / Leroy Merlin migre ses rapports BO vers GCP Looker par milliers " .... Mais dans la plupart des cas, la migration des multiples bases de données en sour...

Rechercher dans ce blog

{openAudit}, le data lineage et les usages de l'information pour transformer les SI

The first action to modernize an Information System: Eliminate unnecessary pipelines?

The first action in modernizing an Information System:

Eliminate unnecessary pipelines?

Commentaires

Enregistrer un commentaire

Posts les plus consultés de ce blog

Power BI libère les utilisateurs… Mais comment garder la maîtrise de sa plateforme dans le temps ?

De la source à la cellule du dashboard : Cartographier le SI pour le reconstruire intelligemment

Migrer de SAP BO vers GCP Looker - Garder ses données en source ? Possible ?