relatively straightforward with the use of data ware-
houses (Golfarelli and Rizzi, 2009; Kimball, 2011;
Kimball and Ross, 2013). In the bicycle sharing, such
data can be from the meteorological data sets explain-
ing weather conditions near each station, or whether
events appear near the stations. The essential function
of the data warehouse is to integrate data and trans-
form them into the confirmed structure, relevant for
the domain of interest. In this case, the domain of the
urban sustainable mobility to support reporting and
interpreting for the diverse urban stakeholders groups.
In Section 3.2, linguistic summaries are applied
on the bicycle sharing data to reveal patterns. Bene-
ficially, the Key Performance Indicator (KPI) for the
stakeholders should be expressed linguistically (Vais-
man and Zim
´
anyi, 2022). The reporting with linguis-
tic summaries gives a reliable, trustable, and easy-to-
understand overview of the actual state regarding the
stations (see Section 3.2). This interpretations provide
the basis for supporting effective decision-making for
the city (see Figure 2). In our case, KPI is expressed
by linguistic terms: few, about half and most of.
In order to generate the linguistic insights as afore
computed, the relevant ”data ingestion” framework,
or the way how the data are fed to the data ware-
house, should be set in place. Relevant levels of secu-
rity should be treated in line with existing regulation
(foremost GDPR) for data management.
The benefit of implementing an approach as pro-
posed in this work, is that a local authority can, if of
interest, extend the reporting of the collected data by
the other internal and external data and adjust report-
ing to diverse user categories. For instance, citizens
and users with disabilities benefit of LS, which pro-
vide an easy way to communicate mined patterns. For
advanced users, like traffic experts, external data cov-
ering, e.g., traffic densities of cars, weather conditions
and organized events can reveal how these parame-
ters influence bicycle sharing in the affected stations.
With an approach as is under the development in this
work, local authorities would have a better overview
of cycling in the city. Next, suited LSs can be adopted
for informing cyclists about their riding behaviour in
comparison to the average values, for instance. How-
ever, a summary of structure your length of ride is
around average is not informative enough. This sum-
mary is the same for person who use bicycle every
working day (but not during weekends) and for some-
one who rarely cycle on working days, but is a heavy
bicycle user during weekends. The better option are
quantified linguistic summaries like few days your
length of rides is slightly under average, and about
half of days your length of rides is about average.
5 DISCUSSION
The proposed approach has a significant applicability
potential. This holds especially when informing di-
verse urban stakeholders groups (including disabled
citizens) should be realized by a robust and compact
approach.
In our data intense society, we face the problems
of explaining mined patterns (Smits et al., 2018). A
promising way is by linguistic summaries. In order to
contribute, we raised two research questions. The first
research question is How can relevant enterprise ar-
chitectures be set to collect and transform urban data
coming from different sources? The answer is that the
processing data by the linguistics summaries requires
a clear data warehouse model for storing data coming
from diverse sources. Such data are relevant for ad-
vanced explaining summaries and revealed patterns.
The second research question is Could short quan-
tified sentences improve explainability of data? The
answer depends on the structure of the sentences. If
it is a basic structure of summary (1), then histograms
and the other charts are suitable when visual attention
is not disturbed (i.e., it should not be focused else-
where) (Arguelles and Trivi
˜
no., 2013), or summary
is not for visually impaired citizens. Regarding, the
summary with restriction, the situation is different. It
is more convenient to express it by sentences like in
Section 3.2 than on series of graphs covering more at-
tributes. Anyway, linguistically summarized sentence
is convenient for all urban stakeholders groups.
The next perspective is merging summaries with
maps. Either by using map features in summa-
rized sentences (Hudec et al., 2020), like whether a
particular proportion of objects influences the other
attributes, or interpreting summarized sentences on
maps. The positions of two stations used in exper-
iments (Section 3.2) are shown on map in Figure 3.
The very busy station is located near the river and
canal.
The main problem of mining all relevant summa-
rized sentences is the computational cost, when we
consider all stations and all possible summaries in a
large city. In the future work, we focus our work on
the optimisation in this direction.
6 CONCLUSIONS
A smart and sustainable mobility should rely on a
heavier use of bicycles. One possibility is by the bi-
cycle sharing concept. This work has shed light on
summarizing the bicycle sharing patterns of the bicy-
cle stations.
SEC-SCIS 2024 - Special Session on Soft Computing in Ethicity and Smart Cities Services
766