How to combine business approaches, advanced statistics, and technology? - Ekimetrics
Ekimetrics: big data specialists
Back to all insights

How to combine business approaches, advanced statistics, and technology?

How to combine business approaches, advanced statistics, and technology?

It is sometimes difficult to create a link between highly sophisticated statistical approaches and business reality. The transformation of data into value is an art that needs to combine three different pillars: business understanding, advanced statistics and technology. This triangular approach needs to be adopted by all machine learning projects.

Date : June 19th 2019

Category : Thought Leadership

Ekimetrics’ client, a leader in the insurance sector, wanted to improve management of its churned customers as part of a much broader corporate initiative: to adapt its positioning from a financial assets company into a service company, allowing its policy holders to better manage their personal risk .

A data-driven approach is therefore an integral part of their long-term corporate vision and one of the key factors for project success as identified by Ekimetrics. It involves a real paradigm shift for the insurer and its teams around the world.


#1 Customer needs and constraints

An integrated vision of the customer journey that reconnects key moments

Most companies only have a partial vision of their customers’ journey, due to either resource constraints or availability of data.

At Ekimetrics, we believe that building a holistic view of the client journey requires first and foremost a thorough understanding of the company’s business line and business sector, which allow for identification of the key interactions marking the various stages experienced by clients.

In the beginning of the engagement, a series of interviews and meetings with employees identified three main groups of client interactions during the life of an insurance policy.


Requests for risk coverage by the insured party.

Requests for risk coverage are typically followed by a insurer decision-making stage, e.g. decision to accept/deny the risk coverage request depending on the risk levels. This step requires various information exchanges between the insurer and its clients and in-depth evaluation of multiple criteria and outcomes (e.g. conditional acceptances, range of possible scenarios, debtor’s financial health, macro-economic factors, etc.), contributing to the complexity of the journey and the data to be manipulated.


Interactions carried out by the account manager.

These interactions involve day-to-day management, penalties, claims tracking and marketing events. As part of their current CRM strategy, a range of predefined actions impact the customer experience throughout the contract (e.g. three-month training, six-month sales visit, information requests). This provided an opportunity to collect CRM data which was not realized, as account managers had not been consistently entering data into the CRM tooling.


Management of claims.

Claims are delicate situations that require a case-by-case approach, with numerous possible scenarios/outcomes: reimbursement in instalments, total or partial reimbursement, negotiated reimbursement times, etc. This is a crucial aspect of the client relationship that has a significant impact on satisfaction, which will be outlined further below.


A sophisticated statistical approach involving dynamic scoring

The exploration of these three groups of interactions soon revealed that the client journey was not linear from one client to the next, generating further complexity. The key learning was that there is no isolated event with a drastic impact on churn, but that the customer experience could ultimately be affected significantly by a series of apparently inconsequential events.

A technological solution would have been complicated to construct, since each organisation had business line specific characteristics, various risk factors and differing contractual vs non-contractual business models (soft churn). Given the complications, starting at business line level to construct an ad-hoc solution was much more likely to succeed and therefore we adopted a tailor-made approach.


Score and reasons for the score: explain the customer’s versatility

The insurance sector is relatively advanced in keeping up with the latest statistical techniques, therefore it was apparently to most stakeholders that management of attrition should be based on the establishment of a dynamic score.

It is important to note that a key success factor was the ability to statistically capture changes in client satisfaction based on interactions over the life of the policy, which meant the following two aspects needed to be fully captured: chronological nature of events (the order of events is important) and dynamic scoring (the score must be continuous ). It was therefore logical to approach the project as the reconstruction of a story: the story of client behaviour, with all its unforeseen twists, whether positive or negative.

The goal of the project was to enabling the insurer to improve the client relationship at the times when it really counts and to reinvent pre-existing processes with departments where necessary.


Avoiding the black-box effect: guaranteeing legibility of the results for the Executive Committee and business lines

Many scoring methods are available, however some are more suitable than others in terms of business interpretation. For example, machine learning approaches are particularly difficult to translate; making them workable for business lines is often a challenge.

While most good data providers can deliver analytically powerful results, few manage to make these deliverables legible and functional in a business context and are unable to answer the question of: what do I need to change in my daily life or strategy to solve my problem?

This is where the concept of value creation comes into its own. The final step – translation of insights into business line value – remains a challenge for most data/advanced analytics providers in 2019.

From the outset, we made ensured that our approach was adopted by teams, both upstream and downstream – a vital prerequisite for garnering support for the project and creating tangible value.


#2 Solution adopted – Participants, criteria, process and timing

Technical: implementation of an initial prototype


IT environment, IS department: the cornerstone of a successful start

The prerequisite for recovering data is to connect to the information systems in order to access, collect and begin exploring the data. It is important to be flexible at this stage, since there are as many IT systems as there are companies!

It is necessary to have in-depth understanding of the environment of each Cloud provider (AWS, Azure, GCP), to juggle the various technologies and build an IT environment that will serve the business objective in the short and medium term. This step can take from a few weeks to a few months, depending on the nature of the databases available and their suitability for the intended objective.

To avoid bottomless pit syndrome, the approach favoured by Ekimetrics is to prioritise time-to-market for the initial results, which meant that stakeholders were convinced though concrete proof, and findings were based on an iterative and agile approach.


Collect the right data: type, quantity and structure

As with a car, no statistical model or algorithm works effectively without fuel. The data injected into the model should be chosen very carefully and be targeted based on a direct link to a specific business line problem. The variety, completeness, quality and quantity of the data impact the selection and relevance of a specific algorithm.

For the insurer, client data was naturally at the centre of the equation. We explored the accessibility, relevance and strength of the transactional data (purchasing behaviour), activity data (behaviour at each stage of the conversion tunnel such as site navigation, etc.), and customer service relationships data (complaints, etc.).

Since some data could not be used, we settled on decision-making data, commercial data (account manager interactions) and data concerning claims/complaints handling, which were reliable enough to identify a complete and realistic client journey. If none of this data was available, then the project may have been unworkable before it has even started. A data scientist’s creativity in identifying the right data or finding ways to obtain it is a key skill.

Quantity also matters. To ensure the reliability of our statistical models, it was necessary to check the volume of data available. In the case of this insurer, we harvested several hundred thousand data items, all necessary prerequisites for training the model, to enable it to draw credible and reliable conclusions.

Finally, the structure of the accessible data also influences the choice of algorithm. Deep learning, for example, requires a wealth of data on several dimensions, such as time, clients, events, etc. The unavailability of certain data immediately eliminates the possibility of choosing certain machine learning approaches, although that was not the case for our client.


Defining the best statistical approaches: machine learning and descriptive statistics

Machine learning: a neural network to tackle complexity

We have already discussed the complexity of the client journey, superimposing several layers of interactions by different departments and the creation of numerous possible scenarios depending on the type of events (positive decision, refusal, policy renewal subject to conditions, etc.).

We therefore chose to take a neural network approach (via a variant called “RNN” – Recurrent Neural Network), a type of model able to capture temporality and to capture the chronology of key moments, as well as to determine the positive or negative impact of events on customer satisfaction.

We deployed a dynamic approach to scoring, capable of evolving over time. This deep learning approach was the most appropriate to carry out this multi-dimensional analysis; traditional scoring would have required a massive amount of manual effort to create variables and would not have been fundamentally capable of building a client story. A deep learning approach, on the other hand, is powerful enough to capture the chronology and positive and negative influences on behaviour.

Descriptive statistics: bringing clarity to the black box

As with any neural network approach, however, there is an issue of interpretation as it is difficult to distribute contribution to the individual events . To maintain a certain clarity and encourage a consistently business-oriented message, we incorporated more descriptive statistics and very transparent models (linear regression, decision trees, etc.). This hybrid usage of statistical methods, support from the Executive Committee, adoption by operational teams, and quality controls (consistency checks, etc.) enabled us to achieve a subtle balance between precision of results and tangible business value and to propose realistic action plans.


Implementation: ensuring commitment and adoption by operational staff

As discussed at the start of this case study, it was imperative for the insurer that this project would not be simply another POC to be filed away.

In the context of the transition in the insurer’s positioning from an “assets” company to a “client-oriented service” company, it was crucial that the results of this analysis be implemented and used by local teams as part of their effort to improve client experiences across the company.

We therefore developed tactical but highly practical implementation steps to bridge between the strategic vision through data science and the day-to-day lives of the operational teams. Here are some examples:

  • Before undertaking full-scale implementation (> 4,000 employees), we rolled out the results to a limited number of business lines in targeted countries, in order to carry out testing and adjustments if necessary before global roll-out.
  • Updated every 15 days, the dynamic curve graph allowed departments to officially recognise the risk of churn and so to implement practical actions, at the right time, to improve the client experience – another good way of intertwining data driven thinking into operational activities.
  • Updated every two weeks, this dynamic curve enabled the departments to officially observe the churn risk and put concrete actions in place to improve the customer experience at the right time. It was also a good way to help the operational teams adjust to a data-driven way of thinking daily.
  • With our sample of champions, across several countries and business lines, we also identified the most useful information for the operational teams to help them advance visually along the curve. This provided a simple way of giving access to new information and preparing for the more widespread roll-out of this data during the final implementation stage (accessible via the information system, the ERP, etc.).
  • Finally, we introduced specific scripts required by the business lines. For example, in order to receive proactive alerts of spikes in risk we created risk thresholds and automated an alert system. These highly tailored and high value-added actions become part of everyday life and brought the brand promise to life on all levels.

An example is constructed below:

  • events impacting client satisfaction have been identified and positioned chronologically, to enable a reading in the form of a “story”;
  • risk thresholds were determined by Ekimetrics in agreement with the client;
  • it is easy to read: each department can visualise the sequence of events, gradually leading to an increasing risk of attrition;
  • this data visualisation proposed by Ekimetrics made the analyses accessible to the business line. This encouraged understanding of the project and therefore adoption by the teams.

Data Science provided the common thread between the development of the relational promise , the choice of use cases in support of this new brand positioning and eventual implementation. In addition to statistical models, we believe in the importance of implementing new, sustainable data and analytical capacities; leading to an ecosystem of data solutions, new analytical frameworks and new business line processes to promote a better understanding of client expectations.

Article co-written with the EBG team as part of The Digital Benchmark, and delivered in Berlin in May 2019 to more than 800 CMOs, Marketing Directors, Data Directors and Customer Experience Directors.


Read more about The Digital Benchmark

> Subscribe to the data science & AI newsletter!

All our industries

Latest news

Contact us!

  • We're committed to your privacy. Ekimetrics uses the information you provide to us to contact you. For more information about how we handle your personal data and your rights, check out our Privacy policy.
  • This field is for validation purposes and should be left unchanged.