<?xml version='1.0' encoding='UTF-8'?>
<!DOCTYPE article PUBLIC "-//NLM//DTD JATS (Z39.96) Journal Publishing DTD v1.1d1 20130915//EN" "JATS-journalpublishing1.dtd">
<article xmlns:xlink="http://www.w3.org/1999/xlink" article-type="research-article">
  <front>
    <journal-meta id="journal-meta-24b1807c95f44045ac145506541893bf">
      <journal-id journal-id-type="nlm-ta">Lund University, Sweden</journal-id>
      <journal-id journal-id-type="publisher-id">Lund University, Sweden</journal-id>
      <journal-id journal-id-type="journal_submission_guidelines">https://www.tsr.international/#:~:text=Traffic%20Safety%20Research%20(TSR)%20is,Traffic%20safety%20(ICTCT)%20association.</journal-id>
      <journal-title-group>
        <journal-title>Traffic Safety Research</journal-title>
      </journal-title-group>
      <issn publication-format="print">2004-3082</issn>
    </journal-meta>
    <article-meta id="article-meta-90fbab46af3547c6afe60345da966e3f">
      <article-id pub-id-type="publisher-id">e000076</article-id>
      <article-id pub-id-type="doi">10.55329/guai2275</article-id>
      <article-categories>
        <subj-group>
          <subject>Research article</subject>
        </subj-group>
      </article-categories>
      <title-group>
        <article-title id="article-title-8b7777e5bd664df5bfca32a6b3533cc9">How smooth is your ride? Comparison of sensors and methods for surface quality assessment using IMUs</article-title>
      </title-group>
      <contrib-group>
        <contrib contrib-type="author" corresp="yes">
          <contrib-id contrib-id-type="orcid">0000-0002-7870-8127</contrib-id>
          <contrib-id contrib-id-type="role">Conceptualization, Formal analysis, Investigation, Methodology, Software, Writing—original draft.</contrib-id>
          <name id="n-33c75ec333b8">
            <surname>Beeking</surname>
            <given-names>Moritz</given-names>
          </name>
          <email>moritz.beeking@salzburgresearch.at</email>
          <bio>
            <graphic xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/8045a821-6966-4c0b-ab45-5db6136bed7f-umoritz_beeking.png" content-type="author-image"/>
            <p>received his master’s degree in computer science with a minor in physics and a specialization on cognitive systems and robotics in 2021 from the Karlsruhe Institute of Technology (KIT) in Karlsruhe, Germany. Currently he works as a data scientist in the Mobility and Transport analysis group at Salzburg Research in Salzburg, Austria. His research focuses on the processing of data collected by sensor-equipped bicycles, especially using neural network based perception methods for LiDAR data.</p>
          </bio>
          <xref id="x-f73aca7ead47" rid="a-5ec7da02b3c8" ref-type="aff">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0009-0007-9169-1538</contrib-id>
          <contrib-id contrib-id-type="role">Investigation, Methodology, Visualization, Writing—review &amp; editing.</contrib-id>
          <name id="n-a9b28dab0c14">
            <surname>Wies</surname>
            <given-names>Hannah</given-names>
          </name>
          <bio>
            <graphic xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/64ecfbc6-ed01-4cfc-808d-dc790a2cadf3-uhannah_wies.png" content-type="author-image"/>
            <p>received the B.Sc. degree in geography from the University of Freiburg, Germany, in 2018, and the M.Sc. degree in applied physical geography and mountain research from the University of Graz, Austria, in 2022. Since 2022, she has been working as a researcher in the Mobility and Transport analytics group at Salzburg Research, Austria. She focuses on automated and connected mobility as well as active mobility including the collection and analysis of cycling data.</p>
          </bio>
          <xref id="x-50c7d4d46fb2" rid="a-5ec7da02b3c8" ref-type="aff">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0000-0001-9407-9570</contrib-id>
          <contrib-id contrib-id-type="role">Formal analysis, Methodology, Visualization, Writing—review &amp; editing.</contrib-id>
          <name id="n-b4b0c4cc417d">
            <surname>Steinmaßl</surname>
            <given-names>Markus</given-names>
          </name>
          <bio>
            <graphic xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/afe1a022-842a-43f0-9357-5c55baaeba99-umarkus_steinmassl.png" content-type="author-image"/>
            <p>earned his B.Sc. degree in mathematics in 2018 and his M.Sc. degree in data science in 2020, both from the Paris Lodron Universität Salzburg, Austria. Since 2018 he works for the Mobility and Transport analytics department of Salzburg Research in Salzburg, Austria. His research interests lie in processing and analysing traffic participants’ movement data on different scales, including floating car data on a national scale to high-frequency trajectories acquired by stationary object trackers at intersections.</p>
          </bio>
          <xref id="x-47550cb6ea3e" rid="a-5ec7da02b3c8" ref-type="aff">1</xref>
        </contrib>
        <contrib contrib-type="author">
          <contrib-id contrib-id-type="orcid">0000-0003-4052-5867</contrib-id>
          <contrib-id contrib-id-type="role">Conceptualization, Funding acquisition, Supervision, Writing—review &amp; editing.</contrib-id>
          <name id="n-f6f8370b9779">
            <surname>Rehrl</surname>
            <given-names>Karl</given-names>
          </name>
          <bio>
            <graphic xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/28fc48e7-91a9-4946-a86a-ed005145fd10-ukarl_rehrl.png" content-type="author-image"/>
            <p>Karl Rehrl holds a diploma degree in computer science from the University of Linz, Austria and a doctoral degree in geo-information from the Technical University of Vienna, Austria. He is heading the Mobility and Transport Analytics (MTA) research group at Salzburg Research, an applied research institute specialized in the field of Motion Data Intelligence. His research interests are in analysing and interpreting motion data in the field of mobility &amp; transport, with an emphasis on Trajectory Data and Cooperative Services. Karl Rehrl has 20+ years of experience in initiating and heading applied research projects and pilot demonstrations and published 70+ scientific articles. He is an editorial board member of the Journal of Location Based Services and the International Journal on Geographic Information Science.</p>
          </bio>
          <xref id="x-04d7b596fca9" rid="a-5ec7da02b3c8" ref-type="aff">1</xref>
        </contrib>
        <contrib contrib-type="editor">
          <name id="n-a18f78076a25">
            <surname>Schwab</surname>
            <given-names>Arend</given-names>
          </name>
          <xref id="x-412282abddcf" rid="a-d2c271f455e4" ref-type="aff">2</xref>
        </contrib>
        <contrib contrib-type="reviewer">
          <name id="n-6f3abe7fd365">
            <surname>Rasch</surname>
            <given-names>Alexander</given-names>
          </name>
          <xref id="x-cba17f3f1ac8" rid="a-b853acad75a7" ref-type="aff">3</xref>
        </contrib>
        <aff id="a-5ec7da02b3c8">
          <institution>Salzburg Research Forschungsgesellschaft mbH</institution>
          <country>Austria</country>
        </aff>
        <aff id="a-d2c271f455e4">
          <institution>Delft University of Technology</institution>
          <country>the Netherlands</country>
        </aff>
        <aff id="a-b853acad75a7">
          <institution>Chalmers University of Technology</institution>
          <country country="SE">Sweden</country>
        </aff>
      </contrib-group>
      <pub-date date-type="pub">
        <day>17</day>
        <month>12</month>
        <year>2024</year>
      </pub-date>
      <volume>7</volume>
      <elocation-id>Guest editor</elocation-id>
      <history>
        <date date-type="received">
          <day>31</day>
          <month>1</month>
          <year>2024</year>
        </date>
        <date date-type="accepted">
          <day>11</day>
          <month>11</month>
          <year>2024</year>
        </date>
      </history>
      <permissions>
        <copyright-year>2024</copyright-year>
      </permissions>
      <abstract id="abstract-abstract-title-83930c3a9c5c424daf39f9d0cda53190">
        <title id="abstract-title-83930c3a9c5c424daf39f9d0cda53190">Abstract</title>
        <p id="t-9c83b870c3f4">As a major component of riding comfort, surface roughness has a significant impact on peoples' decision to ride bicycles. Riding comfort is most commonly derived from accelerations measured by inertial measurement units (IMUs). However, roughness metrics from different works are not directly comparable as no ‘benchmark data’ exists. This work aims at alleviating this problem by comparing several well-established methods from literature on the same data. Furthermore, to quantify the effect of different sensor systems, for each test run data from both a smartphone and an industrial grade IMU were collected. To compare the derived roughness measurements, the reliability and stability of each sensor-method combination is calculated using non-parametric statistics. The results indicate handlebar mounted smartphones to be sufficient for surface roughness assessment. Furthermore, the selected roughness calculation method has the biggest impact on resulting assessments, above the impacts of both sensor and analyzed segment length. Based on the results, recommendations for surface roughness assessment are provided in the conclusion.</p>
      </abstract>
      <kwd-group id="kwd-group-dd444b4057df47ae8932032905819ff3">
        <title>Keywords</title>
        <kwd>cycling comfort</kwd>
        <kwd>inertial measurement unit (IMU)</kwd>
        <kwd>infrastructure assessment</kwd>
        <kwd>sensor bike</kwd>
        <kwd>surface roughness</kwd>
      </kwd-group>
    </article-meta>
  </front>
  <body>
    <sec>
      <title id="title-b3d56987a7df478ea435b6e7b476bb72">Introduction</title>
      <p id="paragraph-cfb0f3ae8d7b4e5f810c0c1452d9aa4b">Faced with climate change, congestion and health issues, governments and public bodies throughout the world should try  (<xref rid="R253322532192747" ref-type="bibr">EC, 2023</xref>; <xref rid="R253322532192770" ref-type="bibr">Holger et al., 2015</xref>; <xref rid="R253322532192771" ref-type="bibr">UN, 2024</xref>) to promote cycling as a sustainable, space efficient, and healthy mode of transport. An important factor for people's decision to ride a bicycle is cycling comfort  (<xref rid="R253322532192763" ref-type="bibr">Ayachi et al., 2014</xref>; <xref rid="R253322532192746" ref-type="bibr">CROW, 2016</xref>). The surface roughness of bicycle infrastructure is, among others, a vital component of bicycle comfort  (<xref rid="R253322532192744" ref-type="bibr">Castañon &amp; Ribeiro, 2021</xref>; <xref rid="R253322532192755" ref-type="bibr">Hoelzel et al., 2012</xref>; <xref rid="R253322532192752" ref-type="bibr">Landis et al., 1997</xref>). Multiple works also mention roughness to influence safety (<xref rid="R253322532192755" ref-type="bibr">Hoelzel et al., 2012</xref>; <xref rid="R253322532192775" ref-type="bibr">Kranzinger &amp; Leitinger, 2021</xref>; <xref rid="R253322532192745" ref-type="bibr">Zang et al., 2018</xref>) or actively investigate the correlation (<xref rid="R253322532192774" ref-type="bibr">Astarita et al., 2014</xref>; <xref rid="R253322532192760" ref-type="bibr">Gadsby et al., 2022</xref>). Furthermore, well-established methods to measure surface roughness exist. Most works on measuring the surface roughness of bicycle infrastructure use acceleration data collected with IMUs  (<xref rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>; <xref rid="R253322532192775" ref-type="bibr">Kranzinger &amp; Leitinger, 2021</xref>; <xref rid="R253322532192764" ref-type="bibr">Litzenberger et al., 2018</xref>; <xref rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>; <xref rid="R253322532192745" ref-type="bibr">Zang et al., 2018</xref>). The collected accelerations are then further processed on a segment-wise basis to derive roughness metrics. However, there are different approaches to every single step of this process. With the details explained in section 2, there are differences in the type of IMU used, mounting position, sampling frequency, segment length and used axes of the accelerometer.</p>
      <p id="paragraph-cc6d804bfc08431e93ae35f9bf71d3f8">The most important well-defined standard in the context of surface roughness measurement is ISO 2631-1 on ‘Mechanical vibration and shock - Evaluation of human exposure to whole-body vibration’. It defines acceptable vibrational acceleration levels measured an weighted at different frequencies between 0.1 Hz to 80 Hz. Measurement locations, directions, and further process characteristics are also lined out. (<xref id="xref-3d28f90a80a441819b95387135f6b94b" rid="R253322532192776" ref-type="bibr">ISO, 1997</xref>)</p>
      <p id="paragraph-24678497ade942c0ab861a5818af8f98">Furthermore, the International Roughness Index (IRI) is a well-established metric in road surface assessment. It is defined as the vertical displacement between the actual road surface and a virtual ideal plane. Its unit is either millimeters of displacement per traveled meter or meters of displacement per traveled kilometer (metric system, identical), or inches of displacement per mile traveled (imperial system, linearly proportional). The IRI is based on the so-called quarter-car model simplifying vehicles to a single wheel with both sprung and unsprung mass atop. (<xref id="x-dde572024fbf" rid="R253322532192761" ref-type="bibr">Sayers &amp; Karamihas, 1998</xref>)</p>
      <p id="paragraph-1981b8a903f745bdbe466b4a06b54c7e">Although based on ISO 2631-1 and the IRI some recurring aspects exist, every work introduces its own method of calculating roughness based on the measured accelerations. While the findings of some of these works have been cursorily compared  (<xref id="xref-9422800cbaa3417f88b48451696d12b7" rid="R253322532192772" ref-type="bibr">Gadsby &amp; Watkins, 2020</xref>), a proper comparison of methods and their characteristics has not been conducted yet. Similarly, according to a recent review on the related topic of bikeability indices  (<xref id="xref-412ec56254bb4cfcb45ce537e001ada5" rid="R253322532192744" ref-type="bibr">Castañon &amp; Ribeiro, 2021</xref>), several existing works point out that the lack of comparability and standardization is hampering the applicability of bikeability indices. The same effect can be assumed for surface roughness indices. Therefore, the aim of this work is to compare the results of applying selected roughness calculation methods to data collected with two different sensors during the same test rides. The overall process to do so is outlined in <xref id="x-84184a43c508" rid="f-e650207397d9" ref-type="fig">Figure 1</xref>.</p>
      <fig id="f-e650207397d9" orientation="portrait" fig-type="graphic" position="anchor">
        <label>Figure 1 </label>
        <caption id="c-9b07cd31ae81">
          <title id="t-b3a6b5facc3c">Comparison process outline</title>
        </caption>
        <graphic id="g-8394fa017525" xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/ebf82380-97d5-411b-9734-4ca3dd25c51b-ufigure_1.png"/>
      </fig>
      <p id="paragraph-c977a3360faa4f7eab47a82046cfa64c">As described in detail in section 3, two criteria were developed for comparison of the resulting assessments. First, the reliability, defined as the agreement of any given assessment with all other assessments for the same segment. And second, the stability, defined as the inverse of the dispersion of each sensor-method combination per segment. Based on these criteria, each sensor-method combination's assessments are evaluated. Finally, using these evaluations as well as practical considerations like sensor availability or ease of method implementation, recommendations for choosing the right sensor-method combination for a given task are provided.</p>
      <p id="paragraph-fe7b2fc0f495496292a6f728de9654fc">The rest of this work is structured as follows. First, the most relevant works on bicycle comfort and especially roughness assessment based on acceleration measurements are described. Special attention is given to the selection of the five works chosen for comparison. Subsequently, the methodology consisting of data collection, roughness calculation and comparison of results is described in detail. Next, the stability and reliability of the different sensor-method combinations are compared and interesting findings pointed out. Last, the work is concluded by emphasizing the main findings and their implications for bicycle infrastructure assessment, and finally pointing out possible future research directions.</p>
    </sec>
    <sec>
      <title id="title-2328ea2163a34559b7c22d7217d2593c">Related work</title>
      <p id="paragraph-f26b7870467f4b70b3453be46185cead">The research fields of cycling comfort estimation and especially surface roughness estimation based on acceleration measurements have both been well explored in the past. The following chapter will first outline some relevant reviews and works on bicycle comfort, emphasizing the role of surface roughness. Subsequently, typical sensor setups and measuring parameters for acceleration measurement are presented. Next, the basics of roughness calculation and some notable examples are presented. Finally, the selection process used to decide which methods to feature in this comparison, as well as some general features of the selected methods will be described. The in-depth explanation of the implemented methods can be found in section 3.</p>
      <sec>
        <title id="title-5590a267f73f47758ebba153b180433b">Cycling comfort and surface roughness</title>
        <p id="paragraph-9552d098dcaf4909bd5242729244597d">Numerous concurring definitions of cycling comfort can be found in literature  (<xref rid="R253322532192746" ref-type="bibr">CROW, 2016</xref>; <xref rid="R253322532192744" ref-type="bibr">Castañon &amp; Ribeiro, 2021</xref>; <xref rid="R253322532192752" ref-type="bibr">Landis et al., 1997</xref>; <xref rid="R253322532192756" ref-type="bibr">Yamanaka &amp; Namerikawa, 2007</xref>; <xref rid="R253322532192769" ref-type="bibr">Yamanaka et al., 2013</xref>). However, there are certain recurring aspects of comfort mentioned by the majority of works on the topic. Both cycling comfort in general and its sub-aspect surface roughness are often referred to in so called bikeability indices trying to assess how well-suited a given area is for cycling. The review of bikeability indices by <xref id="xref-9c3323c03a614db5a147c7e56c9cfc09" rid="R253322532192744" ref-type="bibr">Castañon and Ribeiro (2021)</xref> provides a good overview of recent works on the topic and, more relevant here, bikeability criteria considered by these works. The criteria analyzed most often are geometric design features of the cycling infrastructure, the accessibility of relevant areas, the traffic safety and the topography of the study area. Each of these aspects was considered by eight out of the 14 works they looked at. The geometric design features include the surface type and condition, used by 5 out of the 14 works as a bikeability indicator. The only other indicator also used by five of the included works was the presence of dedicated cycling infrastructure. Thus, the surface roughness, considered to result from the combination of surface type and condition, can be considered to be among the most relevant aspects of cycling comfort. (<xref id="xref-5a438fb4328f410bbc1269c715ae6130" rid="R253322532192744" ref-type="bibr">Castañon &amp; Ribeiro, 2021</xref>) </p>
        <fig id="figure-11c8886b977546cda4b39d7d38be1f52" orientation="portrait" fig-type="graphic" position="anchor">
          <label>Figure 2 </label>
          <caption id="caption-9783f337a98f4bf493347f18dfe7cced">
            <title id="title-6318f3c031c148438194ccdf72e2b378">Common mounting positions of accelerometers for surface roughness assessment.To the top right of either drawing, the corresponding axes are annotated. These are the assumed axes of the bicycle, the axes of the actual accelerometers should be expected to be at least slightly rotated.</title>
          </caption>
          <graphic id="graphic-0a9835561d2947cfae4d63cdc6b644e2" xlink:href="https://s3-us-west-2.amazonaws.com/typeset-prod-media-server/bfe17e6a-d489-4c14-86ba-f40207139430image2.png"/>
        </fig>
        <p id="p-f8d218e1cd44"><xref id="x-5d15ee293d40" rid="R253322532192769" ref-type="bibr">Yamanaka et al. (2013)</xref> and <xref id="x-1339743db541" rid="R253322532192756" ref-type="bibr">Yamanaka and Namerikawa (2007)</xref> developed and calculated a bicycle level-of-service in two separate works. In the first work  (<xref id="xref-c6c3e84ffafa4c1287e349f689128ba6" rid="R253322532192756" ref-type="bibr">Yamanaka &amp; Namerikawa, 2007</xref>), they listed the roughness of the road surface, the required effort by the cyclist, and braking, slow speed, speed deviation and comfort of speed as the main factors of a bicycle level-of-service. For the second work  (<xref id="xref-119574fc6a734a75acba23d2a1ad7a16" rid="R253322532192769" ref-type="bibr">Yamanaka et al., 2013</xref>), they changed this list to the stability of the speed, stops, vibrations, steering, traffic density, and braking behavior. Although in the later work vibrations were considered less important, they still confirm surface roughness to be a relevant aspect of cycling comfort.</p>
        <p id="paragraph-ce0efce0e3ef45c686ace5919fe06b50">Surface roughness being of major importance to cycling comfort is not a new concept either. In the late eighties, Axhausen et al. already found it to be of importance especially for experienced cyclists in a preference study  (<xref id="xref-6369d4a4b04e47cb9ddd22e23117ee72" rid="R253322532192748" ref-type="bibr">Axhausen &amp; Smith, 1986</xref>). This was later experimentally proven by Landis et al., finding a strong correlation between pavement surface condition and participants comfort perception  (<xref id="xref-d852aa8da10a458792a4e09ab6eed080" rid="R253322532192752" ref-type="bibr">Landis et al., 1997</xref>). Last, the English version of the Dutch CROW Design Manual for Bicycle infrastructure, often considered a kind of gold standard for cycling infrastructure, states ‘The surfacing on the road section should satisfy the requirements in terms of evenness.’ (<xref id="x-439adaa79e06" rid="R253322532192746" ref-type="bibr">CROW, 2016</xref>).</p>
      </sec>
      <sec>
        <title id="title-c924644ff7b9493fb7782bc70e755e1e">Acceleration measurements</title>
        <p id="paragraph-31a116cacb9849d3ae30aa51fb736a76">Measuring surface roughness of cycling infrastructure based on accelerations is a well established approach with recurring components found in literature. While several, especially earlier, works used dedicated accelerometers  (<xref rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>; <xref rid="R253322532192751" ref-type="bibr">Gao et al., 2018</xref>; <xref rid="R253322532192742" ref-type="bibr">Neto et al., 2018</xref>; <xref rid="R253322532192773" ref-type="bibr">Olieman et al., 2012</xref>), most recent works rely on smartphones for data collection  (<xref rid="R253322532192775" ref-type="bibr">Kranzinger &amp; Leitinger, 2021</xref>; <xref rid="R253322532192764" ref-type="bibr">Litzenberger et al., 2018</xref>; <xref rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>; <xref rid="R253322532192749" ref-type="bibr">Wijerathne et al., 2018</xref>; <xref rid="R253322532192745" ref-type="bibr">Zang et al., 2018</xref>). In most cases, authors use vertical acceleration values, either by installing the IMU in such a way that one axis is perpendicular to the ground plane  (<xref rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>; <xref rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>), or by calculating the resulting vertical acceleration from all three axes  (<xref rid="R253322532192775" ref-type="bibr">Kranzinger &amp; Leitinger, 2021</xref>; <xref rid="R253322532192742" ref-type="bibr">Neto et al., 2018</xref>; <xref rid="R253322532192745" ref-type="bibr">Zang et al., 2018</xref>). Some works use all three axes of the accelerometer directly  (<xref rid="R253322532192751" ref-type="bibr">Gao et al., 2018</xref>; <xref rid="R253322532192764" ref-type="bibr">Litzenberger et al., 2018</xref>). The accelerometers and smartphones in these studies are mounted to the fork  (<xref id="xref-291eee0bdf8541daa99106e07e25ae0c" rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>), the handlebar stem (<xref rid="R253322532192775" ref-type="bibr">Kranzinger &amp; Leitinger, 2021</xref>; <xref rid="R253322532192745" ref-type="bibr">Zang et al., 2018</xref>), the left or right half of the handlebar  (<xref id="xref-7216f19081894bd6ada20a16f45beff9" rid="R253322532192751" ref-type="bibr">Gao et al., 2018</xref>), or the front  (<xref rid="R253322532192764" ref-type="bibr">Litzenberger et al., 2018</xref>; <xref rid="R253322532192742" ref-type="bibr">Neto et al., 2018</xref>) or back  (<xref id="xref-a5bdc8e86b744b3da01692a1c3847109" rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>) of the top tube. Typical sampling frequencies for the acceleration data range between 20 Hz  (<xref id="xref-f288807466294c948de77c6edad2ca86" rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>) and 100 Hz  (<xref rid="R253322532192764" ref-type="bibr">Litzenberger et al., 2018</xref>; <xref rid="R253322532192745" ref-type="bibr">Zang et al., 2018</xref>), with 50 Hz being most common  (<xref rid="R253322532192757" ref-type="bibr">Harikrishnan &amp; Gopi, 2017</xref>; <xref rid="R253322532192775" ref-type="bibr">Kranzinger &amp; Leitinger, 2021</xref>; <xref rid="R253322532192742" ref-type="bibr">Neto et al., 2018</xref>; <xref rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>; <xref rid="R253322532192749" ref-type="bibr">Wijerathne et al., 2018</xref>) and occasional higher values up to 512 Hz  (<xref id="xref-088d7854da4a46f7be24756c49cd2843" rid="R253322532192758" ref-type="bibr">Calvey et al., 2015</xref>).</p>
      </sec>
      <sec>
        <title id="title-083ecef2ca1d4f87877da1fbf38180d6">Roughness calculation</title>
        <p id="paragraph-3c6bc76fb01248f6ab389b5375b96506">With regard to the roughness calculation, ISO 2631-1  (<xref id="xref-efbfb7c6192a4d64bb46fbf4397b112e" rid="R253322532192776" ref-type="bibr">ISO, 1997</xref>) on whole body vibrations is often mentioned as a basis for roughness measures and indices  (<xref rid="R253322532192751" ref-type="bibr">Gao et al., 2018</xref>; <xref rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>). Furthermore, the IRI  (<xref id="x-52dba8a2ee37" rid="R253322532192761" ref-type="bibr">Sayers &amp; Karamihas, 1998</xref>)  is a well-established roughness measure. Therefore, some works try to directly calculate it  (<xref id="xref-de3842a888aa45e89ea6084b2be43e49" rid="R253322532192745" ref-type="bibr">Zang et al., 2018</xref>), while some consider closely related measures  (<xref rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>; <xref rid="R253322532192751" ref-type="bibr">Gao et al., 2018</xref>). Other works introduce their own comfort measures not related to ISO 2631-1 or the IRI  (<xref rid="R253322532192775" ref-type="bibr">Kranzinger &amp; Leitinger, 2021</xref>; <xref rid="R253322532192749" ref-type="bibr">Wijerathne et al., 2018</xref>).</p>
        <p id="paragraph-f96c97c4a3674531886fa77681c842c4">Roughness calculation is typically performed on a segment-wise basis. There is no standard segment length, with 5 m  (<xref id="xref-38948282571247fb8adba60755be6936" rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>), 10 m  (<xref id="xref-f943383da6a44548be2297277364082b" rid="R253322532192775" ref-type="bibr">Kranzinger &amp; Leitinger, 2021</xref>), 20 m  (<xref id="xref-d8c6a7b73dd24e218af17ed78601b258" rid="R253322532192745" ref-type="bibr">Zang et al., 2018</xref>), and 100 m  (<xref id="xref-911d18830b82430a8bec565fa080db4c" rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>) all found in literature. In some works, test rides are not segmented further and instead roughness is calculated per test route. <xref id="xref-500156c4a3664a5c90f60ccd79f56c23" rid="R253322532192751" ref-type="bibr">Gao et al. (2018)</xref> analyzed test routes with an average length of 250 m, while  (<xref id="xref-d20706f67fba42aeb463828e9b20169d" rid="R253322532192764" ref-type="bibr">Litzenberger et al., 2018</xref>) choose test routes with approximately 100 m each. <xref id="xref-0ac7b256a786476b813f4ddc392c6aa7" rid="R253322532192742" ref-type="bibr">Neto et al. (2018)</xref> did not group the measured accelerations by travel distance but by time, calculating roughness for every 2 seconds of collected data.</p>
        <p id="paragraph-7c10dc27350a4ada86d549938010584c">In recent years, sensor equipped bicycles have become a lot more prevalent than before, also yielding more works on surface roughness assessment. A very good overview of the use of these sensor bicycles in research was compiled by (<xref id="xref-44dfbd3890014d3e80173c641e7001b0" rid="R253322532192772" ref-type="bibr">Gadsby &amp; Watkins, 2020</xref>). They list six works investigating pavement condition. Two works by the same research group  (<xref rid="R253322532192758" ref-type="bibr">Calvey et al., 2015</xref>; <xref rid="R253322532192767" ref-type="bibr">Taylor &amp; Fairfield, 2010</xref>) describe a sensor bike suited for acceleration measurement but no actual calculation of surface roughness. One work used a neural network to classify different pavements  (<xref id="xref-cc35a19c790741439d7b4c4b20173443" rid="R253322532192742" ref-type="bibr">Neto et al., 2018</xref>), however their description was considered unsuited for reimplementation. Two works  (<xref rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>; <xref rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>) were chosen for comparison and are described in the next section.</p>
        <p id="paragraph-38d12695901d448bab405e67c7ae99af"><xref id="xref-13319f3ffe434f948be8cdeffd602393" rid="R253322532192749" ref-type="bibr">Wijerathne et al. (2018)</xref> describe a sophisticated approach to calculate surface roughness independent of accelerometer position and orientation. To this end, they used four smartphones, one mounted on the handlebar, one in a front bag, one attached to the test riders arm, and one carried in a backpack. They used a wavelet-transform followed by a Taylor expansion and tested for Lipschitz continuity to detect bumps. Although impressive, their work was not chosen for comparison in this work. The reason is their focus on detecting single bumps, making comparison with methods yielding a segment-wise roughness metric difficult (<xref id="xref-c762e228388e4e31887dfbb243d8ff7c" rid="R253322532192749" ref-type="bibr">Wijerathne et al., 2018</xref>).</p>
      </sec>
      <sec>
        <title id="title-0c30eebee5e449f991576c15e3b1e190">Used methods</title>
        <p id="paragraph-89648870549448c2adeff07f08ebde2b">From the numerous works using IMUs on bicycles for surface roughness calculation, a selection was made based on three considerations:</p>
        <list list-type="order">
          <list-item id="li-7fb6c3e7c663">
            <p><bold id="s-e5ea1d72072a">Popularity</bold>: well-cited works were chosen over less known ones in an attempt to compare the most used and therefore most relevant approaches.</p>
          </list-item>
          <list-item id="li-17ed3aa0de85">
            <p><bold id="s-09dce08319ee">Quality of description</bold>: for comparison, a reimplementation of the selected methods was necessary. Therefore, methods were excluded if their original description was not sufficiently detailed for this.</p>
          </list-item>
          <list-item id="li-ea3fe528a122">
            <p><bold id="s-5524d0ee4510">Uniqueness</bold>: methods significantly different from others were chosen over slight alterations of more popular works.</p>
          </list-item>
        </list>
        <p id="paragraph-c0ffb786c7b64c2bbd80898a79638f6a">Ultimately, five works were chosen for comparison. The works by <xref id="xref-1e5defcfcf034d6bb3acbf27c42ea6b5" rid="R253322532192768" ref-type="bibr">Bíl et al. (2015)</xref> and <xref id="xref-f23fa333106042fdbaee3078b53876ba" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref> are the most cited works on roughness calculation from accelerations measured using a sensor-equipped bicycle. <xref id="xref-8f7619710bbe4ae1a8c68e6998a5b81a" rid="R253322532192768" ref-type="bibr">Bíl et al. (2015)</xref> describe their algorithm for roughness calculation commendably concise. <xref id="xref-6ed2fac252ae472dbe001e02f0d29371" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref> on the other hand are among the few authors using more than one accelerometer axis for their calculations. However, as a first step they derive the vertical acceleration for further processing. <xref id="xref-760465b2995546dfbec5bef8796be890" rid="R253322532192751" ref-type="bibr">Gao et al. (2018)</xref> are among the even fewer authors actually using all axes and their work is still fairly popular. All of these methods produce numerical metrics. However, for further use in categorization or maintenance of bicycle infrastructure, ordinal metrics are often preferable. Therefore, two works presenting methods yielding such metrics were also chosen for comparison: The work by <xref id="xref-641edc70836f46de9fa8e6b84f6aab76" rid="R253322532192765" ref-type="bibr">Nuñez et al. (2020)</xref> combines video analysis and acceleration measurements for assessing the quality of cycling infrastructure. To this end, they also present an adaption of the ISO 2631-1 (<xref id="xref-a8ea19e06fbe42b4b1dbd36954634110" rid="R253322532192776" ref-type="bibr">ISO, 1997</xref>), yielding six comfort classes for surface roughness. Last, the method presented by <xref id="xref-1a64dc581ef14fcca2fdfbb4207536f3" rid="R253322532192775" ref-type="bibr">Kranzinger and Leitinger (2021)</xref> has been chosen for comparison. It also yields an ordinal metric for surface roughness and, more interestingly, is not based on the acceleration itself but its first derivative, the so called jerks. The methods for roughness calculation from accelerations used in these works are described in detail in section 3, together with descriptions of the corresponding reimplementations. The remainder of this chapter outlines the general approach of each of these works, especially aspects not found in other works.</p>
        <p id="p-05ec277cbf99"><xref id="xref-6a981d5d91f741f485bf09959d8ac4ef" rid="R253322532192768" ref-type="bibr">Bíl et al. (2015)</xref> measure accelerations using a relatively low frequency of 20 Hz, compared to the 50Hz found in most works. The accelerometer is attached to the fork of the bicycle in such a way that one axis is considered vertical and used for roughness calculation. The separate GNSS sensor is mounted to the handlebar, both signals are combined after collection based on their timestamps. For roughness calculation, only downwards accelerations  &gt;1 g are used. The resulting so-called dynamic comfort index (DCI) ranges between zero and one. Higher values correspond to more comfortable roads.  (<xref id="xref-cfe2450e63d64d0ab193ea2e7786ffe4" rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>)</p>
        <p id="paragraph-69963f4e4b344c6e9d96392b2a1148ee">The work by <xref id="xref-74210635b83b4f308c93ecb48b040294" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref> is based on the idea of the quarter-car model used for calculation of the IRI  (<xref id="xref-0002e44e9cd94613bac96559eec5e370" rid="R253322532192761" ref-type="bibr">Sayers &amp; Karamihas, 1998</xref>). They measure accelerations at 100 Hz and GNSS localizations at 1 Hz using a smartphone mounted to the handlebar stem. Vertical acceleration is derived by projection of measured accelerations with a reference unit vector calculated from stand still points at the beginning of each test run. The IRI is meant to represent vertical displacement. In order to calculate it, double integration of the vertical accelerations is used. The resulting IRI is measured in mm/m vertical displacement and ranges from zero to approximately 12 mm/m on their test tracks.  (<xref id="xref-5fa2118eae4841c2be622358b235eda6" rid="R253322532192745" ref-type="bibr">Zang et al., 2018</xref>)</p>
        <p id="paragraph-7a71f47eb05a4b9d8928f7d60094ae69"><xref id="xref-eb32aa9f9d0649878a386971444261e6" rid="R253322532192751" ref-type="bibr">Gao et al. (2018)</xref> used an accelerometer mounted on the left half of the handlebar and a separate GNSS sensor for data collection. They report neither acceleration nor GNSS recording frequency. Their test subject was instructed to keep a constant speed between 12 km/h to 16 km/h. Following ISO 2631-1  (<xref id="xref-9bff0bad91d14eb38b18ff6b543b3bd9" rid="R253322532192776" ref-type="bibr">ISO, 1997</xref>), they weighted the measured accelerations in the frequency domain using a small band filter. Unfortunately, they do not report on the parameters of the weighting or the filter. Notably, they use all three axes of the accelerometer for roughness calculation without extracting the vertical component first. Their so called Dynamic Cycling Comfort (DCC) ranges between zero and just above three for their test routes.  (<xref id="xref-a12cd44ce44341e8b83ef058faae3d8b" rid="R253322532192751" ref-type="bibr">Gao et al., 2018</xref>)</p>
        <p id="paragraph-f5f3d79f54a64822ae1539bcace32409"><xref id="xref-c4329501cd4a4c67b86200677791cee8" rid="R253322532192765" ref-type="bibr">Nuñez et al. (2020)</xref> use two sensors to calculate their so called Bicycle Environment Quality Index (BEQI), a camera mounted at the front of the frame for capturing videos and a smartphone mounted on the top tube in front of the saddle to capture accelerations. The smartphone is mounted parallel to the ground such that a single accelerometer axis is considered the vertical acceleration. Accelerations are sampled at 50 Hz. Notably, they used frequency analysis to find characteristic frequencies of different pavement types. Based on ISO 2631-1 (<xref id="xref-9f53ccb2f84d4d44b275a40fc4fb4cf6" rid="R253322532192776" ref-type="bibr">ISO, 1997</xref>), roughness is rated as one of six classes depending on the root mean square of vertical accelerations.  (<xref id="xref-8b531b0b265944c9a86d9246d33d01ff" rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>) </p>
        <p id="paragraph-be6bcda703a3485ab36b5073e5a37cf9"><xref id="xref-3b7988169ea44d70972048e71bf759be" rid="R253322532192775" ref-type="bibr">Kranzinger and Leitinger (2021)</xref> are the only authors using the derivative of vertical accelerations, called jerks, to calculate surface roughness. They assessed the surface roughness of impressive 436.6 km of cycling infrastructure, analyzing 5 166 km of test runs. Accelerations are captured using a smartphone mounted to the stem of the handlebar. Accelerations are sampled at 50 Hz and GNSS localizations at 1 Hz. Subsequently, four roughness classes are calculated per 10 m segment using the k-means++ clustering algorithm (<xref id="xref-9300ae92f39747cdbaa8d834fe1ce60a" rid="R253322532192766" ref-type="bibr">Arthur &amp; Vassilvitskii, 2007</xref>). As input parameters, the average jerk value and the share of jerks above a given threshold of 1 200 m/s<sup id="superscript-948fece77ce64e5ea7922c463a8254a0">3</sup> are used. The roughness metric of each section is then defined as the class it is assigned to by the clustering.  (<xref id="xref-1ca50f2ec752408888d7c1698163201e" rid="R253322532192775" ref-type="bibr">Kranzinger &amp; Leitinger, 2021</xref>) </p>
      </sec>
    </sec>
    <sec>
      <title id="title-4f8108f4f02743ca9d2e45cb906a03b7">Methodology</title>
      <p id="paragraph-607e17146bbd4892843d3d3ab1169408">The presented work consists of three main steps described in detail in the remainder of this chapter: data collection, roughness calculation and comparison of results. For data collection, methods from different existing works are combined in an attempt to facilitate broad applicability of our results. Roughness calculation is carried out using careful reimplementations of preexisting algorithms. The methods for result comparison were specifically developed for this work based on well-established statistical metrics. The general flow of data, as displayed in <xref id="x-cf206abb9e61" rid="f-e650207397d9" ref-type="fig">Figure 1</xref>, is as follows:</p>
      <list list-type="order">
        <list-item id="li-2463a57e1211">
          <p>Data is collected by a smartphone and a dedicated IMU/GNSS simultaneously.</p>
        </list-item>
        <list-item id="li-a52098d23817">
          <p>The data is fed into multiple methods from literature calculating metrics intended to represent the surface roughness.</p>
        </list-item>
        <list-item id="li-4785d21d55fc">
          <p>The resulting assessments are sent to evaluation algorithms calculating stability and reliability metrics for each sensor-method-segment-length combination.</p>
        </list-item>
      </list>
      <sec>
        <title id="title-323d870c6130475096179fd3042312ad">Data collection</title>
        <p id="paragraph-432396fdb00743ec9ec63c642d15a367">The data used in this work was collected between September and November 2023 in four cities and villages in Austria. The test routes are depicted in <xref id="x-cd17b33c6bd8" rid="f-0a50a92b2e3b" ref-type="fig">Figure 3</xref>. Each route was driven 8–16 times. A smartphone and a dedicated IMU-GNSS-device were mounted side by side for simultaneous data collection. For each route, data was collected in one continuous session, without remounting of any sensors, as to keep their position and orientation steady. The collected data was matched to the so called ‘Graphenintegrationsplattform’ (GIP), a national Austrian digital road graph  (<xref rid="R253322532192742" ref-type="bibr">Neto et al., 2018</xref>; <xref rid="R253322532192759" ref-type="bibr">ÖVDAT, 2022</xref>). The generated trajectories were split into segments of differing length for segment-wise roughness calculation. Data from both sensors was brought to a common data format and collected in a single data set for further processing.</p>
        <fig id="f-0a50a92b2e3b" orientation="portrait" fig-type="graphic" position="anchor">
          <label>Figure 3 </label>
          <caption id="c-8f5500e1b147">
            <title id="t-5169bd055b06">Test routes investigated in this work. Note the different scales per map, which are a consequence of the very different lenghts of test routes.</title>
          </caption>
          <graphic id="g-e8af03d33b46" xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/cc0de139-1bdb-49b6-a0bc-a3d11e6156a8-ufigure_3.png"/>
        </fig>
        <sec>
          <title id="title-2c760eb7f0cf4efb8d4f865ebe82e0b9">Sensor setup</title>
          <p id="paragraph-f8182dde66f2498289d3e680623e6494">The sensor setup used for this work is depicted in <xref id="x-9dcae66ff965" rid="figure-444c648d747f4234b182d2fb80f2094d" ref-type="fig">Figure 4</xref>. It consists of two different sensors in different mounting positions. As discussed in section 2, most recent works rely on smartphones for acceleration measurement. The reasons are their ubiquitous availability and ease of combined acceleration-localization data collection. However, to the best of the authors' knowledge, no comparison whether a dedicated, industrial grade IMU would yield more reliable or stable assessments has been undertaken yet.</p>
          <fig id="figure-444c648d747f4234b182d2fb80f2094d" orientation="portrait" fig-type="graphic" position="anchor">
            <label>Figure 4 </label>
            <caption id="caption-386001ff64854eda989955752e4c981d">
              <title id="title-559abd51dfca4773a372c9600e1b8bea">Sensor setup for acceleration data collection. The Xiaomi Mi 9X smartphone can be seen attached to the right half of the handlebar. The round dark grey device on the handlebar stem is the XSens MTI-680G, containing an industrial grade IMU and GNSS sensor. As can be seen, the XSens is slightly tilted upwards, yielding an IMU z-axis slightly pointing backwards towards the rider. The Smartphone on the other hand is tilted back and to the left, resulting in an IMU z-axis pointing approximately towards the riders left shoulder.</title>
            </caption>
            <graphic id="graphic-e246ead05711447ca56cda404376d6fe" xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/02d8ebd7-370c-4239-8300-d53b5d59d458-ufigure_4.png"/>
          </fig>
          <p id="paragraph-ba80d647a93f4faa8426d81c563ab0a9">The two most popular mounting positions found in literature are the handlebar stem and the top tubes' front end. These are very close to each other, however, like the sides of the handlebar, the handlebar stem turns when steering while the top tube does not. The handlebar right and left of the stem is not often used in literature. However, it is also a commonly used mounting position for smartphones used for navigation and similar tasks. These considerations yielded the following sensor setup:</p>
          <list list-type="bullet">
            <list-item id="li-706e7c7cba71">
              <p>An XSens MTI-680G1<xref id="x-9bc7e94c39fb" rid="f-f2fdcae63d3a" ref-type="fn">2</xref>  containing both an industrial-grade IMU and GNSS sensor mounted to the stem of the handlebar.</p>
            </list-item>
            <list-item id="li-50ef590fafbd">
              <p>A Xiaomi Mi 9X smartphone mounted in the right half of the handlebar using a rigid steel mounting kit.</p>
            </list-item>
          </list>
        </sec>
        <sec>
          <title id="title-111be5cead2548c2a6a448f5222c080f">Map matching</title>
          <p id="paragraph-4cf620a0a6674dc48f4451f91583371c">To calculate roughness of specific road segments, the collected data is first mapped onto a graph representation of the underlying road network. The GIP, a national Austrian digital road graph (<xref id="xref-f7471f0769324866854feae7365f050b" rid="R253322532192759" ref-type="bibr">ÖVDAT, 2022</xref>) was chosen for this. The GIP divides roads into sections delimited by points were another road is met or a significant road characteristic, like the number of lanes, changes. Especially in inner city scenarios, this may yield many very short sections.</p>
          <p id="paragraph-6009594c851a4027ab27c8081940e5ce">Roughness calculation is usually done for fixed length segments. For this work, the selected legths are: 5 m (<xref id="x-3d66ab8ed0ad" rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>), 10 m  (<xref id="x-2411bb712924" rid="R253322532192775" ref-type="bibr">Kranzinger &amp; Leitinger, 2021</xref>), 20 m (<xref id="x-27ce5fd79923" rid="R253322532192745" ref-type="bibr">Zang et al., 2018</xref>), and 100 m (<xref id="x-714e8f3d6f4f" rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>). <xref id="x-a3b27b0920a6" rid="R253322532192751" ref-type="bibr">Gao et al. (2018)</xref>  did not use fixed length segments but defined test routes with an average length of 250 m. To calculate the segment-wise roughness assessments, the mapped data points are regrouped by specifically created fixed-length segments. This is done by subdividing GIP sections into segments of the given length and combining parts of them where necessary. To map the collected data onto the GIP, the following steps are performed:</p>
          <list list-type="bullet">
            <list-item id="li-3a03d3bc70be">
              <p>First, for each test run, a trajectory consisting of all collected GNSS localizations and their correspond- ing time stamps is extracted.</p>
            </list-item>
            <list-item id="li-f2aea87019d7">
              <p>These are mapped onto the GIP sections using an existing implementation of the approach described by  (<xref id="xref-7c9d9f099edf477997e299823f3ff2f7" rid="R253322532192753" ref-type="bibr">Rehrl et al., 2018</xref>).</p>
            </list-item>
            <list-item id="li-afbc272414bf">
              <p>For each route, the segmentation into fixed length segments is calculated such that any given position on any GIP section along the test route corresponds to exactly one fixed-length segment.</p>
            </list-item>
            <list-item id="li-ef74e52eaadf">
              <p>The section-mapped accelerations are then assigned to the corresponding fixed length segments, yielding a list of measurements per test run and segment.</p>
            </list-item>
          </list>
        </sec>
        <sec>
          <title id="title-c1c22f24112c47989dbfed149d9ab206">Data set</title>
          <p id="paragraph-f69ded8747524907b466016022fe7254">The data used for this study was collected along the test routes depicted in <xref id="x-780cd8d7b08f" rid="f-0a50a92b2e3b" ref-type="fig">Figure 3</xref>. Test riders were instructed to keep a speed of approximately 20 km/h. In total, 72.2 km of test rides were conducted, resulting in 11.3 h of test data covering 4.6 km of cycling infrastructure with different surface types as listed in the figure. Along these routes, 77 118 GNSS localizations and 1.66 million acceleration measurements were collected. 2 691 acceleration measurements are considered to be collected from an unmoving bicycle and used for the calculation of reference vectors for the extraction of the vertical acceleration component as described in section 3.2.1. After filtering by speed as described in section 3.2.2, 834 thousand acceleration measurements, about half of the collected measurements, remained for roughness calculation. Depending on segment length, these were assigned to one of 1 193 segments of 5 m, 595 segments of 10 m, 296 segments of 20 m, or 57 segments of 100 m length. The apparent mismatch with the reported 4.6 km of infrastructure is a consequence of segments driven in both directions being counted once for total assessed infrastructure but twice (once per direction) when creating fixed length segments. The slight mismatch between the different segment lengths is a consequence of the last parts of each test route shorter than the given length being dropped.</p>
        </sec>
      </sec>
      <sec>
        <title id="title-1f161dd1ccbd41088f827a0481c54910">Roughness calculation</title>
        <p id="paragraph-cc0629e6e1d9421caee35a09200e9028">To calculate the roughness based on the accelerations grouped by segments there are some preprocessing steps followed by the actual roughness calculation methods. First, required for all compared methods but the one by <xref id="xref-e156aefc6cad4d0f9dc589ee2700c8e4" rid="R253322532192751" ref-type="bibr">Gao et al. (2018)</xref>, the vertical component needs to be extracted from the three accelerometer axes. Also accelerations are filtered based on speed to clip stop points, slow sections and also GNSS errors resulting in very high speeds and possibly erroneously map matched measurements. Finally, the actual roughness calculation using the reimplementations of the methods selected for comparison is performed. The result is one rating per test segment, sensor and method to be compared subsequently.</p>
        <sec>
          <title id="title-9a1064a3e3d74180a1e5a8bf6caf51ca">Calculation of vertical acceleration component</title>
          <p id="paragraph-a2141a8434c544c99d1fa812d15a63fe">All tested methods except the one proposed by <xref id="xref-e217700d09184c2aa772e391da8c5d6c" rid="R253322532192751" ref-type="bibr">Gao et al. (2018)</xref> use only the vertical acceleration for roughness calculation. As the two sensors were mounted in different orientations, both with no accelerometer axis pointing precisely downwards, vertical accelerations had to be calculated from all three axes. Following <xref id="xref-64bc77446c35430fb4ff3b18ce40f637" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref>, this was done using reference gravitational acceleration vectors collected while the bicycle was standing still. While the authors of <xref id="xref-eb5550948d474562bc6a5eacbcf19bcd" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref> instructed their test riders to keep the bike steady for the first five seconds of each test ride we chose a different approach. We filtered all collected accelerations by two criteria:</p>
          <list list-type="order">
            <list-item id="li-01088057fe14">
              <p>A speed below 10<sup id="superscript-2d1f5ea988cc440491845b2aeb944f47">−</sup><sup id="superscript-48f0d9b174734b8f9f3c6b17b87fb721">3</sup> m/s. This was considered as the bicycle not moving along the test route. The speed was calculated from GNSS data, GNSS sampling at 10 Hz with the localization precision limited by the used GNSS sensors. Given these prerequisites, no lower speeds were observed in the data or, upon further investigation, technically possible.</p>
            </list-item>
            <list-item id="li-1c7238298de5">
              <p>A change in resulting acceleration, i.e. the length of the resulting acceleration vector, of less than 0.1 m/s<sup id="superscript-aa4d16a0ef9c432fb63e3e9f38aa32c5">3</sup>. This prerequisite was chosen to filter out two possible situations: First, the bike not moving along the test route, but lateral rolling, like tilting sideways, or otherwise being slightly moved by mounting, operating the sensors or similar situations. And second, erroneously repeated GNSS localization yielding a speed of zero mid-ride.</p>
            </list-item>
          </list>
          <p id="paragraph-56b323ba005d4149ad125486a690b7d4">The remaining accelerations were grouped by test route and sensor. The filtering by test route was done to account for possible changes in exact mount configurations between the data collected at different times. For each sensor-route pair, a reference vector <inline-formula id="if-bdc6c5797f7c"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mover><mml:mi>r</mml:mi><mml:mo>→</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mfenced><mml:mrow><mml:mover><mml:mi>x</mml:mi><mml:mo>¯</mml:mo></mml:mover><mml:mo>,</mml:mo><mml:mover><mml:mi>y</mml:mi><mml:mo>¯</mml:mo></mml:mover><mml:mo>,</mml:mo><mml:mover><mml:mi>z</mml:mi><mml:mo>¯</mml:mo></mml:mover></mml:mrow></mml:mfenced></mml:math></inline-formula> was chosen by taking the median of the remaining values in x, y and z direction of the accelerometer. The median was chosen to dampen the effect of possibly missed outliers. Subsequently, following <xref id="xref-8d38f71f191b42c8b28542ed0eb37b86" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref>, the vertical component <inline-formula id="if-985a054eaf61"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>a</mml:mi><mml:mi>v</mml:mi></mml:msub></mml:math></inline-formula> of each measured acceleration <inline-formula id="if-4984c9d43f42"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mover><mml:mi>a</mml:mi><mml:mo>→</mml:mo></mml:mover><mml:mo>=</mml:mo><mml:mfenced><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mo> </mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo><mml:mo> </mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>,</mml:mo></mml:mrow></mml:mfenced></mml:math></inline-formula> was derived by projection onto the unit vector in vertical direction using <xref id="x-68510bb309e9" rid="dfg-0108cf85c421" ref-type="disp-formula">Equation 1</xref>.</p>
          <disp-formula-group id="dfg-0108cf85c421"> <disp-formula><label>1</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>a</mml:mi><mml:mi>v</mml:mi></mml:msub><mml:mfenced><mml:mover><mml:mi>a</mml:mi><mml:mo>→</mml:mo></mml:mover></mml:mfenced><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msub><mml:mi>x</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mo> </mml:mo><mml:mover><mml:mi>x</mml:mi><mml:mo>¯</mml:mo></mml:mover><mml:mo> </mml:mo><mml:mo>+</mml:mo><mml:mo> </mml:mo><mml:msub><mml:mi>y</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mo> </mml:mo><mml:mover><mml:mi>y</mml:mi><mml:mo>¯</mml:mo></mml:mover><mml:mo> </mml:mo><mml:mo>+</mml:mo><mml:mo> </mml:mo><mml:msub><mml:mi>z</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>*</mml:mo><mml:mo> </mml:mo><mml:mover><mml:mi>z</mml:mi><mml:mo>¯</mml:mo></mml:mover></mml:mrow><mml:msqrt><mml:msup><mml:mover><mml:mi>x</mml:mi><mml:mo>¯</mml:mo></mml:mover><mml:mn>2</mml:mn></mml:msup><mml:mo> </mml:mo><mml:mo>+</mml:mo><mml:mo> </mml:mo><mml:msup><mml:mover><mml:mi>y</mml:mi><mml:mo>¯</mml:mo></mml:mover><mml:mn>2</mml:mn></mml:msup><mml:mo> </mml:mo><mml:mo>+</mml:mo><mml:mo> </mml:mo><mml:msup><mml:mover><mml:mi>z</mml:mi><mml:mo>¯</mml:mo></mml:mover><mml:mn>2</mml:mn></mml:msup></mml:msqrt></mml:mfrac></mml:math></disp-formula></disp-formula-group>
        </sec>
        <sec>
          <title id="title-f0c82c80c90e49c79a4272ef0675010a">Filtering and smoothing of speeds</title>
          <p id="paragraph-faa6689ec1684b7c8a4d44fe3bd025da">With the vertical acceleration calculated for each measurements, the collected data points were filtered based on their speed. First, any measurements with calculated speeds above 40 km/h were discarded. The remaining speeds were smoothed using a rolling cosine window with a width of five measurements in order to dampen the effect of slight GNSS localization errors. Next, measurements with smoothed speeds below 15 km/h were discarded. Finally, measurements with a change in resulting acceleration since the last measurement above 5 000 m/s<sup id="superscript-e7e8ddd6fc7d4f029f6263ffcd356707">3</sup> were removed, as they are also most likely erroneous. On the one hand, this removes the stops at the beginning and end of each test ride as well as stops at traffic lights and similar. On the other hand, measurements collected at speeds below this threshold are undesirable as speed has a major impact on the measured vertical accelerations. At higher speeds, identical vertical displacements yield higher accelerations as the bike is subjected to them in a shorter period of time  (<xref rid="R253322532192751" ref-type="bibr">Gao et al., 2018</xref>; <xref rid="R253322532192773" ref-type="bibr">Olieman et al., 2012</xref>). Thus measurements should be collected at a constant speed.</p>
        </sec>
        <sec>
          <title id="title-66409dc3cacb4ed0a76162719c4e4757">Methods</title>
          <p id="paragraph-7c178fdfb4754c0a9b033363fbdab127">As justified in section 2, five works and their respective roughness calculation methods were chosen for comparison. In section 2, the general approach of each of these works were outlined. In this section, their roughness calculation methods are described in detail, with an additional focus on the reimplementations used for this work.</p>
          <p id="p-62a9e8e63aa1"><xref id="xref-4690b125cdda4ee693408e19b7c942b3" rid="R253322532192768" ref-type="bibr">Bíl et al. (2015)</xref> calculate their DCI according to <xref id="x-99bc252e34e0" rid="dfg-bd2bd38e230c" ref-type="disp-formula">Equation 2</xref>. <inline-formula id="if-986a6a9ff1b9"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>a</mml:mi><mml:mi>v</mml:mi></mml:msub></mml:math></inline-formula> represents the vertical acceleration. <italic id="e-76f044c85802">n </italic>represents the number of vertical accelerations greater than 1 g in one second. They grouped the accelerations by seconds as this was the sampling frequency of their GNSS sensor, allowing them to calculate one DCI value per GNSS point. As we are only interested in calculating metrics per segment, we grouped the accelerations by segments instead, with <italic id="e-ae27f3009697">n</italic> consequently being the number of accelerations greater than 1 g per segment. The <inline-formula id="if-dadeae655c63"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>a</mml:mi><mml:mrow><mml:mi>v</mml:mi><mml:mi>i</mml:mi></mml:mrow></mml:msub></mml:math></inline-formula> are therefore exactly these accelerations. The gravitational acceleration of the earth is not the same around the globe. Several standard approximations of 1 g exist, with 9.81 m/s<sup id="superscript-e546ce23386a4ff0b78ba927cb1681d1">2</sup> often used for manual calculations and 9.80665 m/s<sup id="superscript-277a015102314e918f3e2779c057eed1">2</sup> often used in engineering. However, as we calculated separate gravitational reference vectors per route-sensor pair anyways, we used the length of these as values for 1 g. Thanks to their concise description of the roughness calculation, we are confident in our reimplementation of their method.</p>
          <disp-formula-group id="dfg-bd2bd38e230c"> <disp-formula><label>2</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>D</mml:mi><mml:mi>C</mml:mi><mml:mi>I</mml:mi><mml:mo>=</mml:mo><mml:msup><mml:mfenced><mml:mrow><mml:mfrac><mml:mn>1</mml:mn><mml:mi>n</mml:mi></mml:mfrac><mml:munderover><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>n</mml:mi></mml:munderover><mml:msubsup><mml:mi>a</mml:mi><mml:mrow><mml:mi>v</mml:mi><mml:mi>i</mml:mi></mml:mrow><mml:mn>2</mml:mn></mml:msubsup></mml:mrow></mml:mfenced><mml:mrow><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msup></mml:math></disp-formula></disp-formula-group>
          <p id="paragraph-0555745770a34987b224d70a12554c1f"><xref id="xref-ce1c5430f97d4f01a8f49a40ce945b5e" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref> use double integration of the absolute value of the vertical acceleration <inline-formula id="if-dc7cf723f425"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>a</mml:mi><mml:mi>v</mml:mi></mml:msub></mml:math></inline-formula> divided by the traveled distance S to calculate the vertical displacement according to <xref id="x-56f20c2c190c" rid="dfg-95ac8e69e43b" ref-type="disp-formula">Equation 3</xref>. To this end, t<sub id="subscript-7b3de3d6b3d64eb09a54c348272ad69b">start</sub> and t<sub id="subscript-51de526990ae4d77a119a9ca9eaea56b">stop</sub> are defined as the first and last timestamp of measurements belonging to a given test segment, in the original work of 20 m length. They consider their assessment to closely reflect the IRI  (<xref id="x-67430217aa2e" rid="R253322532192761" ref-type="bibr">Sayers &amp; Karamihas, 1998</xref>), defined as the vertical displacement, typically measured in mm/m. <xref id="x-37748c1268ea" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref>  point out, that distance calculation from speeds and timestamps might be more precise than from localizations. However, as we did not undertake any direct speed measurements, we used the fixed segment length for S. They do not report the integration approximation they used for the discrete acceleration values. We decided to use cumulative trapezoid integration for the inner integral and Simpson's rule for the outer integral. This combination was chosen as it yielded the highest agreement with the other methods and was therefore considered the best possible variant of their described method. With the travel distance calculation done differently and the assumptions on integration, it is difficult to tell how close our results are to ones achieved using their complete system. However, we remain confident that the general concept is reproduced well by our reimplementation.</p>
          <disp-formula-group id="dfg-95ac8e69e43b"> <disp-formula><label>3</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>I</mml:mi><mml:mi>R</mml:mi><mml:mi>I</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:mo>∫</mml:mo><mml:msubsup><mml:mo>∫</mml:mo><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mi>t</mml:mi><mml:mi>a</mml:mi><mml:mi>r</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:msub><mml:msub><mml:mi>t</mml:mi><mml:mrow><mml:mi>s</mml:mi><mml:mi>t</mml:mi><mml:mi>o</mml:mi><mml:mi>p</mml:mi></mml:mrow></mml:msub></mml:msubsup><mml:mfenced close="|" open="|"><mml:msub><mml:mi>a</mml:mi><mml:mi>v</mml:mi></mml:msub></mml:mfenced><mml:msup><mml:mfenced><mml:mrow><mml:mi>d</mml:mi><mml:mi>t</mml:mi></mml:mrow></mml:mfenced><mml:mn>2</mml:mn></mml:msup></mml:mrow><mml:mi>S</mml:mi></mml:mfrac></mml:math></disp-formula></disp-formula-group>
          <p id="paragraph-fee6fd41ee684686884591467573105e">Of the methods compared in this work, the approach by <xref id="xref-1571b608c30642a98e8d0bb072641e9b" rid="R253322532192751" ref-type="bibr">Gao et al. (2018)</xref> is the only one taking all accelerometer axes into account beyond extracting vertical acceleration. Their approach is based on ISO 2631-1  (<xref id="xref-244fcd9873a44cbf8f39f11f023ffa09" rid="R253322532192776" ref-type="bibr">ISO, 1997</xref>), therefore they report to weight the accelerations in the frequency domain. Unfortunately, they do not report the corresponding weighting parameters. Therefore, this work uses unweighted accelerations for the reimplementation. It may be discussed, whether this still allows for comparability of the method, however the information Gao et al. provided did not allow for a more detailed reimplementation. For each accelerometer axis,<xref id="x-7894abb7b9a4" rid="R253322532192751" ref-type="bibr">Gao et al. (2018)</xref> calculate the root mean square value (<xref id="x-3fc92493228d" rid="dfg-9c6a2fb7405e" ref-type="disp-formula">Equation 4</xref>). Subsequently, the euclidean norm of the 3D vector consisting of these root mean square values per axis a<sub id="subscript-34a275d2039d44aa8902bb3fa9a375da">i</sub> is calculated for the resulting acceleration value <xref id="x-b5158c19cd9d" rid="dfg-389417e4b01f" ref-type="disp-formula">Equation 5</xref>. This equates to calculating the resulting vector from three root mean square average vectors along the accelerometer axes and is actually the approach recommended by ISO 2631-1.</p>
          <table-wrap id="table-wrap-976bc14dff6a401d87a1ab7dd903465c" orientation="portrait">
            <label>Table 1</label>
            <caption id="caption-1f5dbc81494449a1ad4d082b8000f372">
              <title id="title-f2f55167f5db448db4df82a78a48cbba">Expected perception depending on the vertical accelerations root mean square per segment as reported by <xref id="x-706cd9683500" rid="R253322532192765" ref-type="bibr">Nuñez et al. (2020)</xref> </title>
            </caption>
            <table id="table-d9e781dff07c43bfbee90c09e8b1fcdc" rules="rows">
              <colgroup>
                <col width="14"/>
                <col width="13.879999999999999"/>
                <col width="14"/>
                <col width="15.190000000000001"/>
                <col width="14.71"/>
                <col width="13.999999999999998"/>
                <col width="14.22"/>
              </colgroup>
              <thead id="table-section-header-2f501737478e">
                <tr id="tr-a286bffb21fc">
                  <th id="tc-ec7f714a8526" align="left">
                    <p id="p-61ffbb524f16"> RMS   in m/s<sup id="s-ac2b49b2e62e">2</sup></p>
                  </th>
                  <th id="tc-65e82b57939f" align="center">
                    <p id="p-f41809407514">&lt; 0.315</p>
                  </th>
                  <th id="tc-30faf348f600" align="center">
                    <p id="p-f9fc6b10c483">0.315 – 0.63</p>
                  </th>
                  <th id="tc-8845ded74fc4" align="center">
                    <p id="p-3dc770cdfee8">0.63 – 1</p>
                  </th>
                  <th id="tc-48bdc6f32cd7" align="center">
                    <p id="p-eeff6da7f6cd">1 – 1.6</p>
                  </th>
                  <th id="tc-438edc42a65c" align="center">
                    <p id="p-d6fa61121f5f">1.6 – 2.5</p>
                  </th>
                  <th id="tc-7c056503dd7b" align="center">
                    <p id="p-3731f503dd3a">&gt; 2.5</p>
                  </th>
                </tr>
              </thead>
              <tbody id="table-section-39a8463e2d67414bb020b16b1e7316e8">
                <tr id="table-row-2582ea42f57e49678bffc3803c2376a7">
                  <td id="table-cell-1dd87e382cbc494fbe3faf46ba4bd578" align="left">
                    <p id="paragraph-5d7dc4fee5394b6a82d251fe95dd2c27"> Expected perception</p>
                  </td>
                  <td id="table-cell-75187ba19280472992c18aadab7b44d3" align="center">
                    <p id="paragraph-7aa7265a9832402d8ed5d1d35f8dcbc7"> Not uncomfortable</p>
                  </td>
                  <td id="table-cell-37abe5b6bfc144e6b383944902c787c0" align="center">
                    <p id="paragraph-e8911a6b82c647e3aafb12bf59aec2e9"> A little uncomfortable</p>
                  </td>
                  <td id="table-cell-55a641554bac43f4a682f8d3847a9423" align="center">
                    <p id="paragraph-16b1cc5df6e54e87912d60868b2f64bd"> Fairly uncomfortable</p>
                  </td>
                  <td id="table-cell-e020dc7ec9e94e0f87c342b9fe5b31bb" align="center">
                    <p id="paragraph-ce920558996547948643fd1633b08be9">Uncomfortable</p>
                  </td>
                  <td id="table-cell-3f13b12a4c724384aa89d06bdff0891a" align="center">
                    <p id="paragraph-3a8feb57f227452c83d19d43e89748f3"> Very uncomfortable</p>
                  </td>
                  <td id="table-cell-0fc2493548f04485b324f96d4730185b" align="center">
                    <p id="paragraph-180407b840584a50aba3089ea8d925d8"> Extremely uncomfortable</p>
                  </td>
                </tr>
              </tbody>
            </table>
          </table-wrap>
          <p id="paragraph-453dd9562617497091cee5f973999894"/>
          <disp-formula-group id="dfg-9c6a2fb7405e"> <disp-formula><label>4</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>a</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msqrt><mml:mfrac><mml:mn>1</mml:mn><mml:mi>T</mml:mi></mml:mfrac><mml:munderover><mml:mo>∑</mml:mo><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mn>0</mml:mn></mml:mrow><mml:mrow><mml:mi>t</mml:mi><mml:mo>=</mml:mo><mml:mi>T</mml:mi></mml:mrow></mml:munderover><mml:msubsup><mml:mi>a</mml:mi><mml:mi>i</mml:mi><mml:mn>2</mml:mn></mml:msubsup><mml:mfenced><mml:mi>t</mml:mi></mml:mfenced></mml:msqrt><mml:mo> </mml:mo><mml:mo> </mml:mo><mml:mi>i</mml:mi><mml:mo>∈</mml:mo><mml:mi>x</mml:mi><mml:mo>,</mml:mo><mml:mi>y</mml:mi><mml:mo>,</mml:mo><mml:mi>z</mml:mi></mml:math></disp-formula></disp-formula-group>
          <disp-formula-group id="dfg-389417e4b01f"> <disp-formula><label>5</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>a</mml:mi><mml:mi>v</mml:mi></mml:msub><mml:mo>=</mml:mo><mml:msqrt><mml:msubsup><mml:mi>a</mml:mi><mml:mi>x</mml:mi><mml:mn>2</mml:mn></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mi>a</mml:mi><mml:mi>y</mml:mi><mml:mn>2</mml:mn></mml:msubsup><mml:mo>+</mml:mo><mml:msubsup><mml:mi>a</mml:mi><mml:mi>z</mml:mi><mml:mn>2</mml:mn></mml:msubsup></mml:msqrt></mml:math></disp-formula></disp-formula-group>
          <p id="paragraph-dc3e8631224c4114b4e7360153c5a054"><xref id="xref-e577a1c00e234e7aa2eb9d27518889e0" rid="R253322532192765" ref-type="bibr">Nuñez et al. (2020)</xref> use a straightforward root mean square (RMS) of vertical acceleration <inline-formula id="if-40d74f723a37"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>a</mml:mi><mml:mi>v</mml:mi></mml:msub></mml:math></inline-formula> as can be seen in <xref id="x-d85d87830ac2" rid="dfg-ac6d4700f665" ref-type="disp-formula">Equation 6</xref>. They report calculating this RMS per 5 m section. This work uses the same approach only testing different segment lengths. Notably, <xref id="x-dd2bc686a31f" rid="R253322532192765" ref-type="bibr">Nuñez et al. (2020)</xref>  do not consider this RMS the final assessment but based on ISO 2631-1  (<xref id="xref-c3fb84b099734d07af937a2f684e1ef6" rid="R253322532192776" ref-type="bibr">ISO, 1997</xref>) they define six vibrations classes as listed in <xref id="x-ab8881abf72c" rid="table-wrap-976bc14dff6a401d87a1ab7dd903465c" ref-type="table">Table 1</xref>. For their BEQI, they take additional factors like traffic density, bicycle parking or accessibility into account. However, as this work is focused on surface quality assessment only the RMS-based roughness rank is used for comparison.</p>
          <disp-formula-group id="dfg-ac6d4700f665"> <disp-formula><label>6</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>R</mml:mi><mml:mi>M</mml:mi><mml:mi>S</mml:mi><mml:mo>=</mml:mo><mml:msqrt><mml:mfrac><mml:mn>1</mml:mn><mml:mi>N</mml:mi></mml:mfrac><mml:munderover><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mi>N</mml:mi></mml:munderover><mml:msubsup><mml:mi>a</mml:mi><mml:mrow><mml:mi>v</mml:mi><mml:mi>i</mml:mi></mml:mrow><mml:mn>2</mml:mn></mml:msubsup></mml:msqrt></mml:math></disp-formula></disp-formula-group>
          <p id="paragraph-e9d350712ed64aef99495845fb39b6df"><xref id="xref-4063c61a040440de88928d887e58a1db" rid="R253322532192775" ref-type="bibr">Kranzinger and Leitinger (2021)</xref> are the only authors not working directly with the accelerations but their first order derivative, the so-called jerks. Furthermore, unlike the works by <xref id="x-e29c67bd955a" rid="R253322532192768" ref-type="bibr">Bíl et al. (2015)</xref>, <xref id="x-e67ed719537c" rid="R253322532192751" ref-type="bibr">Gao et al. (2018)</xref> and <xref id="x-a4b34b1e7a42" rid="R253322532192765" ref-type="bibr">Nuñez et al. (2020)</xref>, they do not use any variant of the root mean square of the derived values. Instead, they first calculate two parameters for each section:</p>
          <list list-type="order">
            <list-item id="li-a55e3c46a6ac">
              <p>The average jerk value per segment, measured in m/s<sup id="superscript-4435b174220a4b2aaa4e3c2e1b37f00b">3</sup>.</p>
            </list-item>
            <list-item id="li-bfd554da16ea">
              <p>The share of heavy jerks per segment. In their original work they considered a ‘heavy jerk’ any jerk above 1 200 m/s<sup id="superscript-89f4d3a750b14e85a8a8ea5dc4cbcf2a">3</sup>. As this value is highly dependent on the used bicycle, IMU, tire pressure, mounting position and solution, and so on, it is not used in this work. Instead a new threshold is defined, using the 70<sup id="superscript-6accc6f68ece4020a456a4670122948f">th</sup>-percentile of jerk values per sensor. The 70<sup id="superscript-44cd29d8651d41129962fb6f467a4474">th</sup>-percentile was chosen with the clustering in mind. As it yielded a wide spread of heavy jerk shares, a truly two-dimensional clustering was possible.</p>
            </list-item>
          </list>
          <p id="paragraph-bb9f46e79545419f818edfe0e1bfbfd3">The resulting average jerk value and heavy jerk share per section are then used as input to a k-means++ clustering algorithm  (<xref id="xref-1ec256149335454abab4b39431f9c23f" rid="R253322532192766" ref-type="bibr">Arthur &amp; Vassilvitskii, 2007</xref>). The number of clusters is fixed to four, and 20 runs with differing cluster center initializations are performed. The best resulting cluster centers are kept for later clustering. Apart from the changed heavy jerk threshold, we consider this implementation a very close reimplementation of the described method.</p>
        </sec>
      </sec>
      <sec>
        <title id="title-7380db93bfec4320a2c03df9da3f7e7b">Comparison of results</title>
        <p id="paragraph-9eec700560b540a295a425bbcf501664">The stated goal of this work as outlined in section 1 is to compare different sensor-method combinations and segment lengths used for surface roughness estimation. To this end, mathematically sound comparison methods are required. While numerical assessments are fairly easy to compare assuming that the assessments are normalized first, the mixture of numerical and ordinal assessment scales proved difficult to compare. For stability, no possibility of comparing numerical and ordinal assessment methods directly was found. Comparing the numerical assessments to each other was straightforward using the coefficient of variation. Comparing ordinal assessments is difficult, as one must not make the mistake of considering classes equally spaced. However, a suited measure for ordinal dispersion was found that allows for comparison of stability between ordinal methods and their combination with different sensors and segment lengths. Regarding the reliability, defined as the agreement of any given method with an ensemble of all considered methods, rank correlation was used to be able to compare all used methods to each other.</p>
        <sec>
          <title id="title-d44eef0dc32a49cdb47b81bdfaa37038">Stability calculation</title>
          <p id="paragraph-6b35e6b0df164aaea32b482412e52b1c">The stability of assessments using a certain sensor-method combination is a metric whether applying this combination to the same test route multiple times will yield the same or differing results. This is of interest mainly to decide how many test runs of a certain sensor-method calculation are necessary to get a result that additional test runs are unlikely to change much. Therefore, the average dispersion over all test segments is used as stability measure. Unfortunately, to the best of the authors' knowledge, there is no common dispersion metric for numerical and ordinal scales. Thus, two different metrics had to be adopted, making results for ordinal and numerical methods incomparable. However, dispersion between different numerical or ordinal methods as well as between sensors combined with the same method are still possible.</p>
          <p id="paragraph-be7616109c0441228c5aa72fbe582209">For numerical assessment methods, the coefficient of variation <italic id="e-ac5a78960205">CV</italic> is used as dispersion metric. As shown in <xref id="x-5ce53c9631bc" rid="dfg-8f032c7ddc6c" ref-type="disp-formula">Equation 7</xref>, it is calculated as the ratio of the standard deviation <inline-formula id="if-2d36f1bcc504"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>δ</mml:mi></mml:math></inline-formula> to the mean <inline-formula id="if-43b4a9ca7815"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>μ</mml:mi></mml:math></inline-formula> of the ratings per test segment. Also called the Normalized Root-Mean Square Deviation or relative standard deviation, it is a widely used standardized dispersion measure.</p>
          <disp-formula-group id="dfg-8f032c7ddc6c"> <disp-formula><label>7</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>C</mml:mi><mml:mi>V</mml:mi><mml:mo>=</mml:mo><mml:mfrac><mml:mi>δ</mml:mi><mml:mi>μ</mml:mi></mml:mfrac></mml:math></disp-formula></disp-formula-group>
          <p id="paragraph-7d50dbcc5340408a910d819114826705">For ordinal assessment methods, ordinal dispersion as defined by <xref id="x-5561a328c96e" rid="R253322532365088" ref-type="bibr">Blair and Lacy (2000)</xref>  is used. <xref id="x-e9132810f5d4" rid="dfg-8f032c7ddc6c" ref-type="disp-formula">Equation 7</xref> shows the calculation of <inline-formula id="if-dfbb73627a10"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msup><mml:mi>l</mml:mi><mml:mn>2</mml:mn></mml:msup></mml:math></inline-formula>, a measure of concentration. For dispersion, the inverse <inline-formula id="if-ac305ffd0403"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:msup><mml:mi>l</mml:mi><mml:mn>2</mml:mn></mml:msup></mml:math></inline-formula> is used. In the formula, <inline-formula id="if-a80a84971b28"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>F</mml:mi><mml:mi>i</mml:mi></mml:msub></mml:math></inline-formula> denotes the cumulative frequency, or sum of its own frequency and that of all lower classes, of the <italic id="e-f1e1d8d29d94">i</italic>-th class. From cumulative frequencies of all k classes but that of the highest class, 0.5 is subtracted, the result squared and these values summed. The highest class' cumulative frequency is dropped as it is always 1. Subtracting 0.5 yields a metric independent of the center of the distribution: if most samples are rated low, a cumulative frequency of 0.5 is reached fast, yielding increasing positive values for <inline-formula id="if-0217d3c5f8c3"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>F</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mo> </mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac></mml:math></inline-formula>. If most samples are rated high, a cumulative frequency of 0.5 is reached slowly, yielding relatively high negative values for <inline-formula id="if-e5a70b4dd534"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msub><mml:mi>F</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac></mml:math></inline-formula>. Squared these become equal, yielding a method only dependent on the width, not the center of the distribution. <inline-formula id="if-e79c57b9fa49"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mfrac><mml:mrow><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mn>4</mml:mn></mml:mfrac></mml:math></inline-formula> is used to get a normed metric. It is the highest possible value of the numerator for an ordinal metric with k possible classes. The highest possible concentration would mean all values in exactly one class. The numerator would then be a sum <inline-formula id="if-163de4f532a4"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:math></inline-formula>-times either <inline-formula id="if-b01dd96a1991"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msup><mml:mfenced><mml:mrow><mml:mn>0</mml:mn><mml:mo>-</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac></mml:mrow></mml:mfenced><mml:mn>2</mml:mn></mml:msup></mml:math></inline-formula> or <inline-formula id="if-dc96d6d7155a"> <mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msup><mml:mfenced><mml:mrow><mml:mn>1</mml:mn><mml:mo>-</mml:mo><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac></mml:mrow></mml:mfenced><mml:mn>2</mml:mn></mml:msup></mml:math></inline-formula>, exactly the value of the denominator.</p>
          <disp-formula-group id="dfg-279f8f3903d7"> <disp-formula><label>8</label><mml:math xmlns:mml="http://www.w3.org/1998/Math/MathML"><mml:msup><mml:mi>l</mml:mi><mml:mn>2</mml:mn></mml:msup><mml:mo>=</mml:mo><mml:mfrac><mml:mrow><mml:msubsup><mml:mo>∑</mml:mo><mml:mrow><mml:mi>i</mml:mi><mml:mo>=</mml:mo><mml:mn>1</mml:mn></mml:mrow><mml:mrow><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:msubsup><mml:msup><mml:mfenced><mml:mrow><mml:msub><mml:mi>F</mml:mi><mml:mi>i</mml:mi></mml:msub><mml:mo>-</mml:mo><mml:mstyle displaystyle="true"><mml:mfrac><mml:mn>1</mml:mn><mml:mn>2</mml:mn></mml:mfrac></mml:mstyle></mml:mrow></mml:mfenced><mml:mn>2</mml:mn></mml:msup></mml:mrow><mml:mstyle displaystyle="true"><mml:mfrac><mml:mfenced><mml:mrow><mml:mi>k</mml:mi><mml:mo>-</mml:mo><mml:mn>1</mml:mn></mml:mrow></mml:mfenced><mml:mn>4</mml:mn></mml:mfrac></mml:mstyle></mml:mfrac></mml:math></disp-formula></disp-formula-group>
        </sec>
        <sec>
          <title id="title-54aed5fae44d4cc6b1fa67b78483196e">Reliability calculation</title>
          <p id="paragraph-c553a50fd23245368b63d0834ef0c064">The reliability of assessments made using a certain sensor-method combination is a metric whether applying this combination to a certain test route is likely to produce similar results as applying an ensemble of other combinations to the same test routes. This is of interest to decide whether one or the other method is more likely to produce a ‘realistic’ assessment of a given test route, assuming that an ensemble of established methods should produce a good assessment. As absolute values of different numerical and ordinal scales are hard to compare, relative assessments are compared instead. To this end, Spearman's rank correlation coefficient  (<xref id="x-47becda63cdc" rid="R253322532329955" ref-type="bibr">Corder &amp; Foreman, 2014</xref>)  is calculated as follows:</p>
          <list list-type="order">
            <list-item id="li-79b9d2155fb0">
              <p>First, for each predefined segment length, the assessments per sensor-method combinations are normalized.</p>
            </list-item>
            <list-item id="li-fe1933037da8">
              <p>Next, an average assessment of all sensor-method combinations is calculated for each segment.</p>
            </list-item>
            <list-item id="li-fc81749580df">
              <p>Third, both the assessments per sensor-method combination as well as the average are ranked.</p>
            </list-item>
            <list-item id="li-123f077a934b">
              <p>Last, Spearman's rank correlation is calculated between each sensor-method combination, all other combinations, and the average of all methods.</p>
            </list-item>
          </list>
          <p id="paragraph-03741aea437c48ec9ea3f495a14e0ba4">The resulting values represent the degree to which any given sensor-method combination will rank the test segments surface roughness in a similar order compared to the other methods and the ensembles average. The underlying assumption is that, while the absolute ratings of any method might depend on a multitude of factors, the ranking of segments should be comparable for any method considered to be able to produce a good assessment of surface roughness.</p>
        </sec>
      </sec>
    </sec>
    <sec>
      <title id="title-9491350d0660436ea408bc6485549366">Results</title>
      <p id="paragraph-0b6b45f7e17145fd941ab0d78ca397af">The surface roughness assessments of different combinations of sensors, calculation methods, and segment lengths were compared using the methods described at the end of the previous section. The results are presented subsequently, starting with the stability of each method across the two sensors and four segment lengths. Next the reliability, defined as the agreement with the other methods and an averaged assessment of all methods, measured using Spearman's rank correlation coefficient, is discussed. Finally, some qualitative examples are presented, and the findings summarized and interpreted.</p>
      <sec>
        <title id="title-8187a84ccf1c461d94b0bebc182731d0">Stability</title>
        <p id="paragraph-0e9fa39da1d6418aa01e1fe85fc64cdf">As pointed out in section 3 the stability of numerical and ordinal assessment methods are not directly comparable. As to the best of the authors knowledge, no dispersion metric applicable to both types of scales exists. However, to portray the effects of sensor and segment length, they are still displayed side by side in <xref id="x-dfdc6a13a6f8" rid="f-ba0585b187fe" ref-type="fig">Figure 5</xref>. At the top of each of the four box plots, the method by <xref id="xref-24a0e7e04b854dcb8428ba0fee689d6e" rid="R253322532192775" ref-type="bibr">Kranzinger and Leitinger (2021)</xref>, denoted ‘Kranz.’, and the BEQI  (<xref id="xref-ce04b9e6d8754f87ad052d37be3692a1" rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>), the two ordinal methods, are shown. As described, the other three methods yield numerical assessments. As can be seen, the used method has the biggest impact on dispersion, consistent across sensor type and segment length.</p>
        <fig id="f-ba0585b187fe" orientation="portrait" fig-type="graphic" position="anchor">
          <label>Figure 5 </label>
          <caption id="c-4874f25ae63a">
            <title id="t-a9241392f7eb">Distribution of the dispersion per segment with respect to sensor type, roughness calculation method combination and segment length (the grey highlighting marks the two ordinal methods)</title>
          </caption>
          <graphic id="g-6f3bd3b0478d" xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/64b52bd0-d370-4f84-b9c7-b8f0eede1e4c-u5_beeking-without-yellow.png"/>
        </fig>
        <p id="paragraph-276006ca4f47496f8c8ad743ea14ade0">Of the ordinal methods, the BEQI is more stable and its dispersion is less impacted by sensor type than the assessment using the method by Kranzinger and Leitinger. Notably, the method by Kranzinger and Leitinger appears to be the least impacted by chosen segment length. Of the numerical methods, the DCI shows exceptionally stable results across both sensors and all segment lengths. The IRI, calculated as described by <xref id="xref-a4df5e6c8b8447b796fd2f7267fb813d" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref> shows a very high dispersion, indicating highly unstable results across test runs. The DCC is somewhat in between, without particularly stable or unstable results. However, among the numerical methods it is least impacted by segment length.</p>
        <p id="paragraph-5d3849d1512a43d78992ec5ad2397ec1">Next, the segment length has a major influence on dispersion. Unsurprisingly, longer segments tend to yield more stable results, except for the IRI calculation as described by <xref id="xref-6dd6110bb76c4a82b3f4b6a88562ebd7" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref>. This was to be expected, as the longer the section is, the smaller is the impact of little differences in the actually driven path. For short segments, even single swerves hitting or avoiding any unevenness have a major impact. This is a disadvantage of any IMU-based method: only the surface directly beneath the tires can be assessed. Notably, as there are about 20 times less segments with a length of 100 m than with a length of 5 m, there are also a lot less outliers for longer segments.</p>
        <p id="paragraph-4ded894cf554485b9caad3b3d04c5886">For shorter segments, the assessments made using the XSens are in general less stable than the ones using a smartphone, with the exception of the combination with the DCI. This also indicates a bigger dispersion in the measured accelerations themselves. However, for longer segments, this effect disappears, likely due to the described ‘smoothing’ effect of adding more data points.</p>
        <p id="paragraph-8e5b675cbe4146499aba168c2107fa7d">In conclusion, the most stable numerical assessments are achievable using the combination of dedicated industrial-grade IMU and the DCI method by <xref id="xref-d319abd81cb64412b9b9e69cdd52b9b7" rid="R253322532192768" ref-type="bibr">Bíl et al. (2015)</xref>. However, especially when using the DCI, the effect of the used sensor is neglectable. The least stable assessments are produced when calculating the IRI as described by <xref id="xref-e433b53864b241879d8ded115d70b191" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref> based on measurements by the dedicated IMU. Of the ordinal methods, the BEQI is more stable, especially for shorter segments and when using the dedicated IMU for acceleration measurement.</p>
      </sec>
      <sec>
        <title id="title-55582c928f3a466b84014682e09aab73">Reliability</title>
        <p id="paragraph-90a0780e039a4f30a2e0a3ee1290a341">As described in section 3 the reliability of each method is defined as its agreement with the other methods, as well as an assessment averaged from all considered methods. Rank correlation is used to be able to properly compare numerical and ordinal assessment methods. All sensor-method combinations are correlated against each other as well as against the average assessment calculated as described in section 3. The correlations are depicted for the different segment lengths in <xref id="x-0e15e3ecbefc" rid="f-7f69d05bfdd3" ref-type="fig">Figure 6</xref> to <xref id="x-61f1540958e5" rid="f-9a57f05479da" ref-type="fig">Figure 9</xref>. The resulting reliability metric ranges between 1 for total accordance, as seen in the correlation of each combination with itself; and -1 for exact opposite ranking of sections.</p>
        <p id="paragraph-60da222e801a4a2c8ee24ea8c01bdd0b">Notably, the DCI  (<xref id="xref-11e8fbb76a8e40799b3419d55af8e35a" rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>), DCC  (<xref id="xref-5a0cb41a74574099897b5e3d804a30b5" rid="R253322532192751" ref-type="bibr">Gao et al., 2018</xref>), and BEQI  (<xref id="xref-c447a3c05aeb4fedacff686bebd71454" rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>) are all highly correlated. This is a result of them all being based on the root mean square of accelerations. The DCI only uses downward accelerations &gt; 1 g, the DCC takes accelerations from all accelerometer axes and the BEQI projects the results onto an ordinal scale. But beyond that, they all use a root mean square of accelerations for vibration assessment as recommended by ISO 2631-1  (<xref id="xref-3151ee37240847e9887ac208b60f6bd7" rid="R253322532192776" ref-type="bibr">ISO, 1997</xref>).</p>
        <fig id="f-7f69d05bfdd3" orientation="portrait" fig-type="graphic" position="anchor">
          <label>Figure 6 </label>
          <caption id="c-b0f6906e1e72">
            <title id="t-0b2121a8868c">Spearman's rank correlation coefficients matrix for a segment length of 5 m</title>
          </caption>
          <graphic id="g-b883b529bde9" xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/71f4b996-12c9-4bfe-90cc-6809911fa53c-ua_beeking.png"/>
        </fig>
        <fig id="f-8c566baf01a6" orientation="portrait" fig-type="graphic" position="anchor">
          <label>Figure 7 </label>
          <caption id="c-d6e909e65cf2">
            <title id="t-f8d988a3a70d">Spearman's rank correlation coefficients matrix for a segment length of 10 m</title>
          </caption>
          <graphic id="g-9a806524e890" xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/f594e8d0-0dd2-4ae9-9215-a3380748096d-ub_beeking.png"/>
        </fig>
        <fig id="f-2cd9781d812b" orientation="portrait" fig-type="graphic" position="anchor">
          <label>Figure 8 </label>
          <caption id="c-b55e3c0455bb">
            <title id="t-ac750ea91dfc">Spearman's rank correlation coefficients matrix for a segment length of 20 m</title>
          </caption>
          <graphic id="g-4c56591db152" xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/cba17bc8-adb3-4005-9f6c-8c43a6128d7f-uc_beeking.png"/>
        </fig>
        <fig id="f-9a57f05479da" orientation="portrait" fig-type="graphic" position="anchor">
          <label>Figure 9 </label>
          <caption id="c-de61288b10ba">
            <title id="t-fbb07aa700f4">Spearman's rank correlation coefficients matrix for a segment length of 100 m</title>
          </caption>
          <graphic id="g-9a012286da67" xlink:href="https://typeset-prod-media-server.s3.amazonaws.com/article_uploads/9098b204-d7db-48a4-98be-95290089f3fc/image/e112b0f1-2222-4274-8e98-72b8f7801005-ud_beeking.png"/>
        </fig>
        <p id="p-60693069784f">This yields a certain problem for taking the correlation with the averaged assessment as a reliability measure, as these three methods might appear disproportionately reliable. For future applications of this comparison approach, this might be alleviated by a more diverse selection of methods. For this work, the presented correlations need to be interpreted with due care.</p>
        <p id="paragraph-483e0d92a2a641999ecdf21bb8b0bb9c">Concerning the effect of segment length, longer segments yield a generally higher correlation of different methods. Especially between the 5 m segments and the 10 m segments, correlation improved for the majority of pairwise compared sensor-method combinations. Increasing the length further yielded small gains for 20 m segments, and somewhat inconclusive results for 100 m segments. Contrastingly, the correlation of the IRI, calculated as described by <xref id="xref-a0e15380ad9042f29a530119d208e915" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref>, decreased with segment length. This indicates some kind of difficulty this method encounters with longer segments.</p>
        <p id="paragraph-9ac6b880c4b64d4cadd2581b67de5a9f">As with stability, the impact of the two different sensors decreases with segment length. Also, it is again highly dependent on the used roughness calculation method. The three aforementioned, root mean square based, methods show high correlations with themselves and towards each other for both sensors used. The IRI ranking is less correlated to the IRI ranking using the other sensor, however still more than to any other method. In general, the IRI is actually negatively correlated to all other methods, indicating some major problems of either the method or the used reimplementation.</p>
        <p id="paragraph-20d1c3ac9c184d60b2b139e12207677b">The method by Kranzinger and Leitinger is most interestingly stronger correlated to other methods than to itself using another sensor, especially for longer segments. Its overall correlation to other methods is rather weak, although increasing with segment length. However, as mentioned three of the methods are based on the same fundamental concept of a root mean square of accelerations. The method by Kranzinger and Leitinger on the other hand is based on a clustering by mean jerks and share of heavy jerks. Jerks are defined as the derivative of the measured vertical acceleration. Presumably, this approaches' sensibility to few very high accelerations is different from the root mean square based approaches. Therefore, comparison to another method considered sound in itself would be required to achieve a trustworthy estimation of its actual reliability for surface roughness assessment. Interestingly, this method is hardly correlated to itself using a different sensor. Presumably, this is the result of the different heavy jerk thresholds and cluster centers calculated for the two sensors.</p>
      </sec>
      <sec>
        <title id="title-a636b9a7c0e04fd89a09b28fea83aa86">Qualitative example</title>
        <p id="paragraph-5a8ad678d4134acc8dbb28764be5f392">As a minimal qualitative example of results, <xref id="x-2c347b2820f0" rid="table-wrap-07d9a8f6d7f542c580d6007679daa2de" ref-type="table">Table 2</xref> lists the ranks of all methods for the on average best and worst assessed segments per segment length and used sensor. As can be seen, the agreement of methods for XSens measurements on these segments is generally high, especially for the good sections. Interestingly, the agreement decreases with segment length. As with stability and reliability, the IRI does not fare too well, again worse for longer segments. The ranks of the two ordinal methods in the last two columns are expected to be less clear, as for all segments in one class, an average rank is used. This also yields the 0.5 ranks.</p>
        <table-wrap id="table-wrap-07d9a8f6d7f542c580d6007679daa2de" orientation="portrait">
          <label>Table 2</label>
          <caption id="caption-04aa287ab2b14d34abe139d1ebb83d96">
            <title id="title-318b1330c7c84dcf922ff2a1b689077e">Ranks of the on average best and worst sections, calculated as described in 3.3.2, per segment length and sensor for all methods</title>
          </caption>
          <table id="table-7ac88302c7174559ac740fe3c2ee7cb4" rules="rows">
            <colgroup>
              <col width="8.68"/>
              <col width="10.45"/>
              <col width="14.479999999999999"/>
              <col width="11.9"/>
              <col width="12.93"/>
              <col width="13.56"/>
              <col width="13.35"/>
              <col width="14.65"/>
            </colgroup>
            <thead id="table-section-header-dbfcf29e55f1">
              <tr id="tr-abef0d400e3a">
                <th id="tc-a126fd6e314b" align="left">
                  <p id="p-ef8ddaa5f609">Segment length</p>
                </th>
                <th id="tc-cb3a15e9ab63" align="center">
                  <p id="p-ee01b04a3f7e">Sensor</p>
                </th>
                <th id="tc-ddadf49628b8" align="center">
                  <p id="p-4dd5fda5387f">Average rank</p>
                </th>
                <th id="tc-0c8850071c3f" align="center">
                  <p id="p-0ccd54f35037">DCI rank</p>
                  <p id="p-f5f3349dd5b2"> (<xref id="x-765a9ca90aca" rid="R253322532192768" ref-type="bibr">Bíl et al., 2015</xref>) </p>
                </th>
                <th id="tc-5ad7afc43552" align="center">
                  <p id="p-4e3b5ba30eda">IRI rank</p>
                  <p id="p-0b5911e0cba4"> (<xref id="x-6c25d1174298" rid="R253322532192745" ref-type="bibr">Zang et al., 2018</xref>) </p>
                </th>
                <th id="tc-007ab63bfded" align="center">
                  <p id="p-f059efc8b7da">DCC rank</p>
                  <p id="p-16bfcf27e369"> (<xref id="x-a9491b9eb3b5" rid="R253322532192751" ref-type="bibr">Gao et al., 2018</xref>) </p>
                </th>
                <th id="tc-7c8baf3891e2" align="center">
                  <p id="p-fcb24c1db298">BEQI rank</p>
                  <p id="p-07fac5accdeb"> (<xref id="x-46115d7ee4c6" rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>) </p>
                </th>
                <th id="tc-0b58463db4f3" align="center">
                  <p id="p-d510d7f5d121">Kranz. rank</p>
                  <p id="p-dd834700c8a0"> (<xref id="x-121d3e3eea84" rid="R253322532192775" ref-type="bibr">Kranzinger &amp; Leitinger, 2021</xref>) </p>
                </th>
              </tr>
            </thead>
            <tbody id="table-section-f2af8480bcb8425d8f5d98779e4285b6">
              <tr id="table-row-e66167e6b8714fd2b82665b7533afa48">
                <td id="table-cell-c49a8040d8cb452b8929d5cbeeaefe2a" align="left">
                  <p id="paragraph-b73fe6a1b1364b7c9e3589b41ea57c99"> 5</p>
                </td>
                <td id="table-cell-f94a9f721b804609985f6aaf21e508c1" align="center">
                  <p id="paragraph-65a4fd8ff4b741edb0a5a7118ffafad5"> Smartphone</p>
                </td>
                <td id="table-cell-237d9d0a8f2c4129a6a8c5dd5516df28" align="right">
                  <p id="paragraph-672a46dfb17d4e6986bdb33bf9e1f7fd"> 1.0 (Best)</p>
                </td>
                <td id="table-cell-8b36f9e33989444384d76ec3faaa74c5" align="right">
                  <p id="paragraph-63f4c7f7451e431a8eb9eba070898e58"> 328.0</p>
                </td>
                <td id="table-cell-2d42a472d7e3464abb619dc905f33309" align="right">
                  <p id="paragraph-97bd2543bad5451a883cd6ac93a9ddf6"> 37.0</p>
                </td>
                <td id="table-cell-3c5a78d7aa3149bcb962518ae1d4d8ff" align="right">
                  <p id="paragraph-a26709d7f4fc4f26918cc6e5b1f15e1f"> 158.0</p>
                </td>
                <td id="table-cell-12491f37cc154fae8c45b6bf8fd608f2" align="right">
                  <p id="paragraph-85229cd8e48b40c890d86509d2224913"> 200.0</p>
                </td>
                <td id="table-cell-b6d95c0e59ee4d50a2999eb8662a3f88" align="right">
                  <p id="paragraph-ba5edf5b97a843fd8f8ee4254e9daa3e"> 137.0</p>
                </td>
              </tr>
              <tr id="table-row-64a9bc1881054c5abb458fb3b08de435">
                <td id="table-cell-cb282239d5ca425db9b24a27e12a770f" align="left">
                  <p id="paragraph-f0cec19b3b764d3f8d6729ce151190cf"> </p>
                </td>
                <td id="table-cell-185c5275b4074c5eb350746f0341cb6f" align="center">
                  <p id="paragraph-dbaa85b24a0e4e2eaa140ddd4a973b46"> </p>
                </td>
                <td id="table-cell-cbb30039b3454f41bfd8119a8ea4625c" align="right">
                  <p id="paragraph-668dde63399546abb4549bee2ec6a18f"> 1 106.0 (Worst)</p>
                </td>
                <td id="table-cell-b41b45a7b4b74d1b9f9258306acb1b21" align="right">
                  <p id="paragraph-f5e3b560d68743b999553a858e8ef7d3"> 1 105.0</p>
                </td>
                <td id="table-cell-c31fe500f6fc4cb482190f8320085dbe" align="right">
                  <p id="paragraph-9673075c7dae45f09d7c39cc1052d6f1"> 123.0</p>
                </td>
                <td id="table-cell-4c2990c0040343848470cef65fbe3031" align="right">
                  <p id="paragraph-6b19f7fc3ad440a2a8c79f7880e6d232"> 1 106.0</p>
                </td>
                <td id="table-cell-b6762553aa6f4a79a398f7cb29bce9b6" align="right">
                  <p id="paragraph-a3d4c88c0bb047caa3da582c6a65d9a7"> 1 063.5</p>
                </td>
                <td id="table-cell-9f8102a65bcb4f5e8738e3f1a26bb08f" align="right">
                  <p id="paragraph-99fe651399eb44c9827aa32fce13b614"> 690.0</p>
                </td>
              </tr>
              <tr id="table-row-bdaf91c55ca74267b00bceceb593a3a5">
                <td id="table-cell-dbe73270fa9d4039a620d5ee0334909b" align="left">
                  <p id="paragraph-91e54b215c4c4ce18af19ffb5e893a10"> </p>
                </td>
                <td id="table-cell-7a5e160bd04b43ac87a81f7d7675a96a" align="center">
                  <p id="paragraph-7bf1a890809346178866fd22f73e535c"> XSens</p>
                </td>
                <td id="table-cell-f6a832f465964aaba96ae9c3628a65f9" align="right">
                  <p id="paragraph-ef791d0fb5f949a5997d7846569c9394"> 1.0 (Best)</p>
                </td>
                <td id="table-cell-fd7b7402d4c44e3a999fa1caccf46c47" align="right">
                  <p id="paragraph-ac49016bd60b4447a41dde33f86f6fe8"> 1.0</p>
                </td>
                <td id="table-cell-0c3e979d427145d199ab49f8410c35b4" align="right">
                  <p id="paragraph-deaa60c69c6d4ff0be6d33bf3d6bf2d1"> 12.0</p>
                </td>
                <td id="table-cell-6e37d0f4607245b2b5a3fd555aa91605" align="right">
                  <p id="paragraph-20ba2911d9d141e5a06ea21be9bd2737"> 1.0</p>
                </td>
                <td id="table-cell-1c41637a83854c96a49f2dcf52b2d464" align="right">
                  <p id="paragraph-2791052b5a8f4360bcf249cbfc38888f"> 2.0</p>
                </td>
                <td id="table-cell-3c2cbc36688449b4bf9c1f09c3f5ef32" align="right">
                  <p id="paragraph-8aa1af60b8b14ff3bcf5026c47eb5091"> 455.5</p>
                </td>
              </tr>
              <tr id="table-row-d2a2f478dbdf40fbad56341b8b06f5d6">
                <td id="table-cell-a5e09a04da334bb8bb399b4050bf0ccf" align="left">
                  <p id="paragraph-50e014a6af9a4a12893a30440d1941d0"> </p>
                </td>
                <td id="table-cell-bcb666b3f8254c969776fc2ba18c784d" align="center">
                  <p id="paragraph-f0f2bf7658c34a30a491bf3b92800b6e"> </p>
                </td>
                <td id="table-cell-b32624d8d87f4130ac41b00a93211ced" align="right">
                  <p id="paragraph-9781d927462a4c0a84ba9c5d7a254c33"> 1 080.0 (Worst)</p>
                </td>
                <td id="table-cell-01d8dbd67253432099620012ba5356dc" align="right">
                  <p id="paragraph-466030d1922b496e9d4f5dfe9a9105cb"> 1 071.0</p>
                </td>
                <td id="table-cell-684ab4aac8a047ef9315ca72b09d5bd2" align="right">
                  <p id="paragraph-f17e5ea219ff4f19b4caf01224a5e8cf"> 791.0</p>
                </td>
                <td id="table-cell-6837fd641bcd451cac9d2a286f8c7f75" align="right">
                  <p id="paragraph-771bb4118909421b8846655a8ffa0615"> 1 072.0</p>
                </td>
                <td id="table-cell-5cc0251072004c4582dd1dda73d414ac" align="right">
                  <p id="paragraph-eab8a2989ced498a9981430380792aa2"> 1 041.0</p>
                </td>
                <td id="table-cell-7b19ac647fe14563b4c57521adfcb1fa" align="right">
                  <p id="paragraph-721edfe16fc44e0cb140ccd06d71c081"> 1 073.0</p>
                </td>
              </tr>
              <tr id="table-row-d9aefc189e8d4cd8985f074dd93dbd4c">
                <td id="table-cell-43075dc27ec149dd9230c54d0dd292d1" align="left">
                  <p id="paragraph-f751c71d396d49c78fe88480007d1740"> 10</p>
                </td>
                <td id="table-cell-74a4df00fb73401883746616fd413365" align="center">
                  <p id="paragraph-b77514c1e9b348b59802e024ffadf4b6"> Smartphone</p>
                </td>
                <td id="table-cell-c48ec75937524958ad59a256417a1c0a" align="right">
                  <p id="paragraph-6f8292e594394d6392dccdf64a1cef6b"> 1.0 (Best)</p>
                </td>
                <td id="table-cell-91c530dbcf2a4e519ad979e9fa10a344" align="right">
                  <p id="paragraph-4bcb79d8e86748d199b486a9188b0216"> 146.0</p>
                </td>
                <td id="table-cell-847d7d648a5342bba058fc0cec6f039a" align="right">
                  <p id="paragraph-743e5ce4e08b44c591aa1c2c83d0a685"> 19.0</p>
                </td>
                <td id="table-cell-a825480e1e584782ab2d699c7015af0e" align="right">
                  <p id="paragraph-9dbe04ef17df460884b545bf6a32ac13"> 60.0</p>
                </td>
                <td id="table-cell-2dba8d11856c4c3c9ef2cc3464b170b5" align="right">
                  <p id="paragraph-0adf20b84f614f8ba9bac601510d9bb3"> 88.5</p>
                </td>
                <td id="table-cell-75c6175d896a44049b8ec014d2bc22df" align="right">
                  <p id="paragraph-005cf4839d83486ba3280dc182690a10"> 54.5</p>
                </td>
              </tr>
              <tr id="table-row-7ad43a41150a46ef9e1a4992de2bb612">
                <td id="table-cell-61fc613d5654466caa0d69a0b7d73a65" align="left">
                  <p id="paragraph-07705d975329483c91e67d894b87394c"> </p>
                </td>
                <td id="table-cell-cd3a664fbf884aeb868591cfc4e2fb3d" align="center">
                  <p id="paragraph-bb944564152649e4a83ca55ab702512f"> </p>
                </td>
                <td id="table-cell-46b93062a7774bc883623a800e9a74f4" align="right">
                  <p id="paragraph-fd370189090c4f2ab3dfc1a4ebc1858f"> 559.0 (Worst)</p>
                </td>
                <td id="table-cell-882eac3fb7584330ab3d91d7636a9319" align="right">
                  <p id="paragraph-fe70b637c4e94503b6c4e0edbce45264"> 558.0</p>
                </td>
                <td id="table-cell-1f6be6383e2646ab9bcdb19e857ded95" align="right">
                  <p id="paragraph-1fce3b79a6c549e7822e4ba543ffeaf7"> 318.0</p>
                </td>
                <td id="table-cell-22d4357f77144f8b849cb72a346c8d06" align="right">
                  <p id="paragraph-858d73b881bb4f06a030fc8a7bef38bd"> 557.0</p>
                </td>
                <td id="table-cell-e06c0414c4274bc992c06718ab473c5b" align="right">
                  <p id="paragraph-cecde2c7275b4274bf2a3c9de441cff1"> 536.5</p>
                </td>
                <td id="table-cell-4cd0e984ddeb4c52a15a38ef811c21a8" align="right">
                  <p id="paragraph-83f6684356ea4f6daabef6b3773bf5bd"> 334.0</p>
                </td>
              </tr>
              <tr id="table-row-81e09e5d2a374bc2b07bea34bcf22e73">
                <td id="table-cell-1de51227daad4f15828d74b88345a363" align="left">
                  <p id="paragraph-445a151169834eed891b649155d0ca6e"> </p>
                </td>
                <td id="table-cell-fca4e4451f134ae9b250282624a1f253" align="center">
                  <p id="paragraph-e20be5f5b27048c98bb6d0ce61fca01e"> XSens</p>
                </td>
                <td id="table-cell-2ac0376fd98e44fbba27406673333397" align="right">
                  <p id="paragraph-b4f307eee0f34f07954605b117f66596"> 1.0 (Best)</p>
                </td>
                <td id="table-cell-c466de8d3d0c42c6af535fb10e3e4d85" align="right">
                  <p id="paragraph-3ef06966291445268d550c80a98069c6"> 1.0</p>
                </td>
                <td id="table-cell-765b958612ce4cc58257ed1b679f0517" align="right">
                  <p id="paragraph-55d55e6507424ec5b9e21a7548b37752"> 6.0</p>
                </td>
                <td id="table-cell-da8e4db9e95348c9a2a1bd68118947ee" align="right">
                  <p id="paragraph-d5b2692230b046298ba131b94e245b4c"> 1.0</p>
                </td>
                <td id="table-cell-95bd475eb9a9403fb07b73b9d371374e" align="right">
                  <p id="paragraph-74ff1454d7594f6cad04c2fe7cd29773"> 1.0</p>
                </td>
                <td id="table-cell-dfee88b509d94e6ba070e380b8788458" align="right">
                  <p id="paragraph-8f78db8c888e4f30a4f7f18826087b1b"> 244.5</p>
                </td>
              </tr>
              <tr id="table-row-5608f2e8af344354bef2f6700bdcd404">
                <td id="table-cell-2cbbfea7436347c4a213546c1c8e2003" align="left">
                  <p id="paragraph-8cacbd37b3a54c32842a32b3caec70e8"> </p>
                </td>
                <td id="table-cell-46a1532168a84b8da8f4e5e361695350" align="center">
                  <p id="paragraph-41d1cb680f604a32a47d78ebc1d470b0"> </p>
                </td>
                <td id="table-cell-f204371776d84a08950eb250b121ff94" align="right">
                  <p id="paragraph-e6d0903a41f84fff8e81131c12950dad"> 550.0 (Worst)</p>
                </td>
                <td id="table-cell-6e1d75dbea7e4bbb9de6622da41dc8bc" align="right">
                  <p id="paragraph-8e0d883209284164ad1bd767452cb208"> 547.0</p>
                </td>
                <td id="table-cell-98e72fde7ba04a83a9bf650722fdb984" align="right">
                  <p id="paragraph-559d31c32b05422fb9039c173aff1b1c"> 425.0</p>
                </td>
                <td id="table-cell-376247f56a094bf9a36f7118ff0d73cb" align="right">
                  <p id="paragraph-16d3048997964b4c938502cf3bf7e6d1"> 547.0</p>
                </td>
                <td id="table-cell-283319139aaf4371b68da1203d3fddbf" align="right">
                  <p id="paragraph-b5440ad8e13346dc8f63735fc0ff0f73"> 529.0</p>
                </td>
                <td id="table-cell-a8deafc1a3a6436eb5df3f570d696ee8" align="right">
                  <p id="paragraph-b818f6cfbbcb487aac2030f3090d5922"> 547.0</p>
                </td>
              </tr>
              <tr id="table-row-762828b040a741508fa04232aa66f84b">
                <td id="table-cell-6b9d9e36bfc04807b17edf12699d4bdd" align="left">
                  <p id="paragraph-365a9db6a8e84355a3b380fd3bb10665"> 20</p>
                </td>
                <td id="table-cell-c833c7c080d64dc389862a6efffeef83" align="center">
                  <p id="paragraph-f9f4ac1b560442b1a344a0794f863077"> Smartphone</p>
                </td>
                <td id="table-cell-80817e1bdc73420a9f5436baff4912fe" align="right">
                  <p id="paragraph-a62f7a0a3279408faa69e3ae80432800"> 1.0 (Best)</p>
                </td>
                <td id="table-cell-d35a5c2b73e2462bb031b43ed13df955" align="right">
                  <p id="paragraph-639301844d8940be95849ed8051c4b89"> 42.0</p>
                </td>
                <td id="table-cell-d7fc58778ac343fab61914c136aaa237" align="right">
                  <p id="paragraph-5bece7f3e84f46e391276d2cc936f1e6"> 124.0</p>
                </td>
                <td id="table-cell-2a3314479f8944e2a2f7422bcad45edf" align="right">
                  <p id="paragraph-7c849ded999141d0a577573614417a86"> 49.0</p>
                </td>
                <td id="table-cell-b06adc8824af403999c7f6cfe4708781" align="right">
                  <p id="paragraph-3115636c3a034e9bb3395aa71ea8a26e"> 40.5</p>
                </td>
                <td id="table-cell-8d4af90b856f49ea9791ee00d65ccd32" align="right">
                  <p id="paragraph-0dc8408247d1484d981f08fe3c84c8c6"> 21.0</p>
                </td>
              </tr>
              <tr id="table-row-974b9b025dc74f6aba35c9d42f7f357b">
                <td id="table-cell-a587f58030f74c45a5fd10b91a2d428f" align="left">
                  <p id="paragraph-25bf3e0dc92443aa810981866b9777be"> </p>
                </td>
                <td id="table-cell-84c13e87f6d24dceba619cd0621db60e" align="center">
                  <p id="paragraph-780b49a0e48740a08dda00b698619aaa"> </p>
                </td>
                <td id="table-cell-2a7632fcd609451abf2c401e109f58ce" align="right">
                  <p id="paragraph-3abb5d0a0c3c41b3843d295237cb08bc"> 285.0 (Worst)</p>
                </td>
                <td id="table-cell-674f64d261f843babdec0758e2efce65" align="right">
                  <p id="paragraph-22e98a3e222e4f1fb73d75194a0fb6d6"> 283.0</p>
                </td>
                <td id="table-cell-d06e5b0c2d724df690434707baa5bb96" align="right">
                  <p id="paragraph-ffcc1a9e3cd44afc9eb34243a6994891"> 143.0</p>
                </td>
                <td id="table-cell-c3e7aba3080d4218b4e520f0c30426e0" align="right">
                  <p id="paragraph-3112f0ffdd17454f90462fe0e8734d5f"> 283.0</p>
                </td>
                <td id="table-cell-1d7fb167bf09495fa11529ca3095d32f" align="right">
                  <p id="paragraph-c82784c2e5ba4a5b88a61ea7e4f50724"> 274.5</p>
                </td>
                <td id="table-cell-be71ad3222954954b478e1c63e92dd64" align="right">
                  <p id="paragraph-b6ad81d3c8dc4116bc39662ef1542900"> 163.5</p>
                </td>
              </tr>
              <tr id="table-row-804f0661f9074fb58421cb0f1a235ed5">
                <td id="table-cell-fd0007408171477f937ee9060aeef9cb" align="left">
                  <p id="paragraph-fb6f1fb30fa34a069e4188afdb52dcc4"> </p>
                </td>
                <td id="table-cell-4998c5400cc340448403ba93b379ff96" align="center">
                  <p id="paragraph-7902f70955ec4cc5bc79ce7ee7eb425d"> XSens</p>
                </td>
                <td id="table-cell-a27107c96a83476ca99a5d381df2fe0d" align="right">
                  <p id="paragraph-bb54903723ba4ded907d9379088fe2bd"> 1.0 (Best)</p>
                </td>
                <td id="table-cell-4aaa6cb35b064ecca2deb6affc215e68" align="right">
                  <p id="paragraph-c3ba673db01e45f2ae8547cbc063071c"> 1.0</p>
                </td>
                <td id="table-cell-a4305a6c08e741aba0960a8c0a8827db" align="right">
                  <p id="paragraph-cd15c7980ca944b2b6145874be6449bb"> 2.0</p>
                </td>
                <td id="table-cell-6d6cbd36fc354f6d85711e7a352974a4" align="right">
                  <p id="paragraph-1e3c9a9e5f7b4355896fabce601f6bde"> 1.0</p>
                </td>
                <td id="table-cell-4e422f4586aa48e8b8f9b3a342faf12a" align="right">
                  <p id="paragraph-6941658017894281b8880554853af006"> 1.0</p>
                </td>
                <td id="table-cell-7fbf9fa0f8b14d0db262ecb22f8b6923" align="right">
                  <p id="paragraph-489337930abc4f3e879cb14ab507d9e5"> 129.5</p>
                </td>
              </tr>
              <tr id="table-row-6e78bc8ec2a94eeea75bb48e5a08288f">
                <td id="table-cell-68ceb36253494e79bf86cdf73102e09b" align="left">
                  <p id="paragraph-89ebb7d5479e4b4b8f401c6b00aa155b"> </p>
                </td>
                <td id="table-cell-e4ae1a16178a418c84acf601abbfebe1" align="center">
                  <p id="paragraph-13ee13fda44945c9899e6ef1c4f02156"> </p>
                </td>
                <td id="table-cell-842bdc13c46a4d489339f2d7205369be" align="right">
                  <p id="paragraph-c98853f132d246dabca127a96707d9cf"> 280.0 (Worst)</p>
                </td>
                <td id="table-cell-0131157d4ec641039fcd5a84aba6cc2c" align="right">
                  <p id="paragraph-87061ede62314eed91ccba936e3d61f9"> 265.0</p>
                </td>
                <td id="table-cell-b0185f7570c24bda9791329dfea52933" align="right">
                  <p id="paragraph-36f30a27b72540a084e5359034808550"> 50.0</p>
                </td>
                <td id="table-cell-e5101d5efb8e4d769a5642176ec128e2" align="right">
                  <p id="paragraph-af2b5af7f20848c4a3776b7b32c2a7d1"> 263.0</p>
                </td>
                <td id="table-cell-3c2b0a2ceafc43909b375d60c2ea6f38" align="right">
                  <p id="paragraph-f9e643db2a7144fc9c36e6443f73caf7"> 270.0</p>
                </td>
                <td id="table-cell-57df30538e2b44e6ac6870be74405ff1" align="right">
                  <p id="paragraph-4ce051d17e3b44acb63d3b9ee49032f9"> 279.5</p>
                </td>
              </tr>
              <tr id="table-row-fbd06ca17a024f4fbe115fa077580db8">
                <td id="table-cell-6e0a2b573fc041c9a856377052faf4fa" align="left">
                  <p id="paragraph-ed8ff150b0594491aa11e624e52e2fa1"> 100</p>
                </td>
                <td id="table-cell-c721008279c34a02a2ade15246dccacd" align="center">
                  <p id="paragraph-e724f0fa1caa4c3291141f631e9689fa"> Smartphone</p>
                </td>
                <td id="table-cell-78d79d7028a844ef975a9287c8af6ae3" align="right">
                  <p id="paragraph-13d8d635eed04ebeb5f7d1d60d72bc95"> 1.0 (Best)</p>
                </td>
                <td id="table-cell-3c84fb3a52bd48d6b4f8c828bf7843e1" align="right">
                  <p id="paragraph-8d62527f9eda40e088821f0b682da84a"> 13.0</p>
                </td>
                <td id="table-cell-b4fa9a7de3e04ddd9bdce77b51969b0e" align="right">
                  <p id="paragraph-1cff36433c3f49c9a352d660189806ab"> 49.0</p>
                </td>
                <td id="table-cell-439fece73b0946abacd77c72c3cdd19d" align="right">
                  <p id="paragraph-be730956bd2d4285961da777a3354fd3"> 21.0</p>
                </td>
                <td id="table-cell-dd0805afc688472b8579f50dec0d25bf" align="right">
                  <p id="paragraph-96b0119fd7534bb2aef729b7d3ed768e"> 23.5</p>
                </td>
                <td id="table-cell-3da2f0875592483fa7c8daa4954c15f1" align="right">
                  <p id="paragraph-08fcd9f9a8074b5fa2cd7196216ef740"> 1.5</p>
                </td>
              </tr>
              <tr id="table-row-c76073ad06f940448ddfcaf9b43ae747">
                <td id="table-cell-f0f6ed1163c1406e96fb9c2bc80fb403" align="left">
                  <p id="paragraph-b41ca673525147459bbd8d41ba15b934"> </p>
                </td>
                <td id="table-cell-3caa9c80697841d287754c25cc560e9a" align="center">
                  <p id="paragraph-f4a711dad4c749379de6454e6eda3085"> </p>
                </td>
                <td id="table-cell-d776c03851e240e3a24f5ee9c314cbf2" align="right">
                  <p id="paragraph-cb54c40666ea413c8e074633135c9b9b"> 57.0 (Worst)</p>
                </td>
                <td id="table-cell-ab85fb003db141c0876491dd8dc228fe" align="right">
                  <p id="paragraph-7c49e9fb37124363825a65cd2f17dbea"> 56.0</p>
                </td>
                <td id="table-cell-e9a932001444454398ac25bb988f78ec" align="right">
                  <p id="paragraph-bf544b7637d74c0bba49c87f86fb25a9"> 23.0</p>
                </td>
                <td id="table-cell-9d2ae497276b4b24863641513758fb89" align="right">
                  <p id="paragraph-905235b70dae40d1b9d6dcb1798b8c7e"> 56.0</p>
                </td>
                <td id="table-cell-8e2818d2413d4f3b877d8a6bd1fb0f2a" align="right">
                  <p id="paragraph-b4bf8eaffecf4b48a8b167cbac989936"> 56.0</p>
                </td>
                <td id="table-cell-eb3c657ffc2e49b58fb779576e12e188" align="right">
                  <p id="paragraph-440b3466868141ebaad97da4a4fc1e14"> 30.0</p>
                </td>
              </tr>
              <tr id="table-row-28a7e08c3285485caa478f6f817e8a06">
                <td id="table-cell-aebec7dbc66946fd8b48eae7bd3eb0ea" align="left">
                  <p id="paragraph-3f06f7d2c80f472580d8322756095a37"> </p>
                </td>
                <td id="table-cell-d5be9933c99542c89705132208a43b8e" align="center">
                  <p id="paragraph-7181e75f5dc54cd2b2286be56cbcbe2b"> XSens</p>
                </td>
                <td id="table-cell-2ca831010a62487cb8dd34c1a46ef848" align="right">
                  <p id="paragraph-0b1870e3dd2247fbbc63a346869e6a86"> 1.0 (Best)</p>
                </td>
                <td id="table-cell-45b64719062d43e0b19f07ce33a0642e" align="right">
                  <p id="paragraph-7f3c6617185d4e3184b233e1968eb05e"> 5.0</p>
                </td>
                <td id="table-cell-18e1e5d86ed847328a802351afdd9aee" align="right">
                  <p id="paragraph-325dfd1f0ed7479189b53e323b12192c"> 9.0</p>
                </td>
                <td id="table-cell-52d799a78b8d48e386414cce7df2e4af" align="right">
                  <p id="paragraph-3b08b80b402c4ccb9b39e4180cdefd4f"> 4.0</p>
                </td>
                <td id="table-cell-5f38f0ce43c943108317e1164c562879" align="right">
                  <p id="paragraph-fbc69e0a33e84e80b96e40e1a6169c28"> 7.0</p>
                </td>
                <td id="table-cell-3b872332e4994809a1a00270270b062f" align="right">
                  <p id="paragraph-a0b31f394e7e4ce6b60ff1c52fd7dd0c"> 28.0</p>
                </td>
              </tr>
              <tr id="table-row-3348591a07964415a24b572801d79ddf">
                <td id="table-cell-1f16ee1eeec74a98a68e47450cecfb9b" align="left">
                  <p id="paragraph-a52453ac36754b4c995bc1feb779470b"> </p>
                </td>
                <td id="table-cell-e317be14d543415e8e38499c751514f4" align="left">
                  <p id="paragraph-975b5c193c6f443b96e210abd132220b"> </p>
                </td>
                <td id="table-cell-c87a0b9c38d04206a1fb1243fe5a74b9" align="right">
                  <p id="paragraph-4bf08b7d5bbe462ea3e80fc2bd4528e8"> 57.0 (Worst)</p>
                </td>
                <td id="table-cell-c616792b0cac4bdea4d0c82c4731d021" align="right">
                  <p id="paragraph-3c932c58cd55443dbc8cf1d0c121f282"> 57.0</p>
                </td>
                <td id="table-cell-6899b4976bba49949cea2219b7379cc1" align="right">
                  <p id="paragraph-656c282471224e3bb1a2da13e9e86863"> 19.0</p>
                </td>
                <td id="table-cell-68202059cd84416e8c51feebaffd6030" align="right">
                  <p id="paragraph-0e4ce4266f6549228d02b35ebbfb5935"> 56.0</p>
                </td>
                <td id="table-cell-491b10e3a2864f1c86702c163a7e4c19" align="right">
                  <p id="paragraph-199b8372d6c447a1b6d55c0f1be038b4"> 56.0</p>
                </td>
                <td id="table-cell-6779e33981194b40839be7972e5295f0" align="right">
                  <p id="paragraph-cfbd49e693964f62ad973a95ebd902d1"> 56.5</p>
                </td>
              </tr>
            </tbody>
          </table>
        </table-wrap>
        <p id="paragraph-a852a962db1a47b4b59d6cad7ab1edaa">For accelerations measured using a smartphone, there is less of an agreement, especially for the good segments. Interestingly, for the good sections, the method by Kranzinger and Leitinger appears to yield better results, however this might also be a misinterpreted effect of the class distribution of the two ordinal methods. Furthermore, the best segments all feature asphalt in good condition, while the worst sections all feature rough cobblestone surfaces. This confirms the general assessments of the used methods. However, 16 out of about 4 000 segments can at best be used to get a general idea of the characteristics of each method. For proper analysis of the methods, quantitative approaches as described are required.</p>
      </sec>
      <sec>
        <title id="title-b765dd6014d14cec88399db6aa7359f6">Discussion</title>
        <p id="paragraph-ee7f3a8bce1f42cea967d55e1c4113e1">The described reliability, stability, and exemplary qualitative results allow for some deductions. First and foremost, the selected method has the biggest impact on the resulting assessment, regarding both reliability and stability, well above the impacts of used sensor and segment length. This supports the ongoing research into better assessment algorithms. Furthermore, it sets the focus for any attempts at assessing infrastructure surface quality in a given area.</p>
        <p id="paragraph-75667e120e4846c288e00c837f0cb284">Second, segment length had a bigger impact on reliability than on stability. This indicates that using several measuring runs is a well suited approach to stabilize the outputs of the considered methods. As a side effect, the impact of the different sensors was less visible on reliability than on stability. However, it still mostly depended on the used method. Overall, except for the IRI, longer segments yield more similar assessments both per test run and also when comparing methods. On the other hand, they also decrease spatial resolution and might impede the detection of small problematic areas by a ‘smoothing’ effect.</p>
        <p id="paragraph-97d92aac386f4c8286972730181f474c">Furthermore, the effect of different sensors, although dependent on segment length, can be considered to have a minor impact on the resulting assessment. Especially for the three root mean square based methods, results using either sensor were very comparable. For the IRI, the generally bad results are worse when based on smartphone measurements. Regarding the method by Kranzinger and Leitinger, it is interesting to note that there is little correlation when applying the method to the acceleration data collected using either sensor. However, as noted it is the only method with parameters tuned to each sensors measurements.</p>
        <p id="paragraph-4d605d8c136c416a937ed2f0e5d90dbe">Last, the coherent results of both the stability and reliability measures indicate the chosen comparison methods to be well suited for the task at hand. This supports the assumptions leading to their selection and might be a useful finding for any future works aiming to compare their own, newly developed, methods to existing ones. As the described methods are mathematically well defined and straightforward to implement, little effort is needed to apply them in any such future works.</p>
      </sec>
    </sec>
    <sec>
      <title id="t-c0fb147c055a">Conclusions </title>
      <p id="paragraph-b5a352879a834bcaa0ac3a17d26a9bde">The aim of this work, as stated in the introduction, is to compare the assessments of the selected surface roughness calculation methods and sensors in order to provide recommendations for such assessments. To this end, acceleration measurements were conducted and the collected measurements assigned to different length segments. Based on these, roughness metrics were calculated using reimplementations of the selected metrics. These were then compared using specifically developed comparison methods. The results concerning stability, reliability, and results on selected segments were presented in the previous section. These yield the following recommendations for the usage of certain sensors and methods for surface roughness assessment.</p>
      <p id="paragraph-d39e34d12c864526b9780a3153a1f6c3">First and foremost, the DCI developed by <xref id="xref-3886a0847eb04bbb908a68253d3936cc" rid="R253322532192768" ref-type="bibr">Bíl et al. (2015)</xref> yielded the most stable results across both sensors and all segment lengths, but especially for shorter segments. It was among the least affected by sensor selection and also among the highest correlated with all other methods. Furthermore, the method description in the original work is commendably concise and the used algorithm is straightforward and easy to implement. Therefore, the DCI emerges from this comparison as the recommended approach to assess surface roughness.</p>
      <p id="paragraph-0c04155bb7ce4da6bf2618401f52c370">Next, the method to calculate the IRI as described by <xref id="xref-4d99397b7df24bb394b066589bc52d24" rid="R253322532192745" ref-type="bibr">Zang et al. (2018)</xref> yielded very unstable results, which were also not correlated well with the other methods. Therefore, this method should not be used for roughness calculation unless two prerequisites are met: First, either through communication with the original authors or by extensive own exploration, a stable, reliable implementation of the method is found. To this end, the exact implementation of the double integration as well as possible filtering steps would need to be examined carefully. And second, high quality measuring equipment firmly attached to the bike yielding as little noise as possible needs to be used to alleviate the problem of high outlier sensibility. As these are two considerable obstacles, the method cannot be recommended despite its solid foundation in road maintenance and inherent elegance.</p>
      <p id="paragraph-a6aeb1d0e7db4d75b4d95d3b8bda232e">If an ordinal assessment is required, the BEQI  (<xref id="xref-46577e46611a4459b34cc410e54813f6" rid="R253322532192765" ref-type="bibr">Nuñez et al., 2020</xref>) is both more stable and better correlated to the other methods than the method by <xref id="xref-2ed6fe5175aa4a0486ff7a42f662fdf9" rid="R253322532192775" ref-type="bibr">Kranzinger and Leitinger (2021)</xref>. However, as the BEQI only splits the root mean square of vertical accelerations at certain thresholds another viable option emerges: First, calculate the DCI as described by <xref id="xref-884a4644aa2e4696b07371e8a5af35a0" rid="R253322532192768" ref-type="bibr">Bíl et al. (2015)</xref>, then select fitting thresholds for an ordinal assessment based on the results.</p>
      <p id="paragraph-a51e779295f44e04a0d66c976f27942f">The DCC as described by <xref id="xref-155ef79e764a491e8b8e9c62cb364158" rid="R253322532192751" ref-type="bibr">Gao et al. (2018)</xref> is neither particularly stable or unstable nor reliable or unreliable. It is therefore not recommended before or against, but considered a solid variant of ISO 2631-1  (<xref id="xref-c28663f5485b40c7af08d0f327f3995b" rid="R253322532192776" ref-type="bibr">ISO, 1997</xref>). This ISO standard however should at least be considered by any attempt at developing a surface roughness assessment method based on acceleration measurements.</p>
      <p id="paragraph-a1df3ea53d524aa9ae5b922b775cb8c2">Choosing the right segment length is another important consideration. As expected, the stability generally increased with longer segments. So did the correlation between methods, indicating a high reliability of assessments on longer segments. On the other hand, spatial resolution is lost and short problematic sections might be missed because of the ‘smoothing’ effect of longer segments. With the differences in correlation between the segment lengths in mind, 5 m and possibly shorter segments should only be chosen where absolutely required as they yield both unstable and somewhat unreliable results. Segments of 10 m to 20 m are both reasonable, so selection should be made based on the aforementioned criteria. Longer segments, especially 100 m and beyond yield limited gains in reliability and stability, and should only be used if the assessments need to be based on acceleration data of questionable reliability for some reason.</p>
      <p id="paragraph-02627e793ee1413b9d2fd97b52194328">The impact of the different sensors on both stability and reliability is neglectable compared to both method and segment length. Therefore, a handlebar mounted smartphone can be concluded to be sufficient for bicycle infrastructure assessment. This furthermore confirms the applicability of smartphone-based crowd sourcing approaches for large scale infrastructure assessment.</p>
      <p id="paragraph-421771e8ad7d4c9f88f0b89bb3fad844">Using acceleration measurements for roughness estimation is a well established and wide spread approach. Still, it is not without limitations that should be considered, especially when attempting large scale infrastructure assessment. These limitations, as described in detail in different sections throughout this document, are as follows: First and foremost, each test ride only captures the roughness directly beneath the tires. Therefore, any reliable assessment needs to be based on multiple rides on the same routes. Next, the measured accelerations are highly dependent on the type of bicycle used, the weight of the rider, the tire pressure, and the riding style, especially the speed. Thus, data from different sources is not directly comparable. This might be mitigated by sufficiently large numbers of measurements to work with averages instead of single test rides. Finally, using IMUs for roughness estimation does make distinguishing different reasons for roughness difficult. Cobblestone is barely different from unintentionally rough surfaces, manhole covers are hard to distinguish from potholes. If the reasons for roughness are to be considered, video based approaches should be considered.</p>
      <p id="paragraph-0ad03642bd2a412787e1fb2b5d8386a6">The comparison approaches developed for this work proved usable and conclusive. One caveat is the effect of multiple similar methods in the comparison, increasing the apparent reliability of these works. However, this can be alleviated by careful selection of comparison methods. At the very least, a feasible way of comparing the reliability of ordinal and numerical assessment methods was presented. Unfortunately, the stability can only be compared among either ordinal or numerical assessments. Nonetheless, the presented approach constitutes a solid tool to select fitting segment lengths for surface roughness assessment methods. </p>
      <p id="p-5f5458a10fc7">Based on the presented research, several possible future research topics arise. The most obvious and easily achievable one would be to apply the comparison to additional methods from literature. However, the authors believe to have chosen reasonable representatives for the most common approaches, and therefore expect little additional value from doing so. More interestingly, based on the consideration that a handlebar-mounted smartphone is sufficient for surface roughness assessment, the effect of using different bikes could be properly quantified. Subsequently, suitable normalization methods could be developed to further advance crowd sourcing approaches. Lastly, the developed comparison methods could be used to develop yet more stable and reliable assessment approaches for high precision roughness mapping.</p>
    </sec>
    <sec>
      <title id="t-4cac09213327">Funding</title>
      <p id="p-8b7a594bda18">The research leading to these results has received funding from the Federal Ministry for Climate Action, Environment, Energy, Mobility, Innovation and Technology (BMK) under grant GZ 2021-0.641.557.</p>
    </sec>
    <sec>
      <title id="t-7401de37fddd">Declaration of competing interests</title>
      <p id="p-9f57989389de">The authors declare no conflict of interests.</p>
    </sec>
  </body>
  <back>
    <ack>
      <title id="title-f44222ea9b02495296388789680fea33">Acknowledgments</title>
      <p id="p-13a2f7e953f5">An early version of this paper was presented as a poster at the 11<sup id="superscript-2">th</sup>  International Cycling Safety Conference (ICSC), held in The Hague, the Netherlands, on 15–17 November 2023.</p>
      <p id="p-d0b3d0725107">The original poster can be accessed at https://www.salzburgresearch.at/publikation/how-smooth-is-your-ride.</p>
    </ack>
    <fn-group id="fg-b169b262455f">
      <fn id="f-f2fdcae63d3a">
        <p id="p-15676f5f4e04"><sup id="s-a92d73902984"/>https://www.movella.com/products/sensor-modules/xsens-mti-680g-rtk-gnss-ins</p>
      </fn>
    </fn-group>
    <ref-list>
      <title>References</title>
      <ref id="R253322532192765">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Nuñez</surname>
              <given-names>Javier Yesid Mahecha</given-names>
            </name>
            <name>
              <surname>Bisconsini</surname>
              <given-names>Danilo Rinaldi</given-names>
            </name>
            <name>
              <surname>da Silva</surname>
              <given-names>Antônio Nélson Rodrigues</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Combining environmental quality assessment of bicycle infrastructures with vertical acceleration measurements</article-title>
          <source>Transportation Research Part A: Policy and Practice</source>
          <year>2020</year>
          <volume>137</volume>
          <fpage>447</fpage>
          <lpage>458</lpage>
          <pub-id pub-id-type="doi">10.1016/j.tra.2018.10.032</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192751">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gao</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Sha</surname>
              <given-names>Aimin</given-names>
            </name>
            <name>
              <surname>Huang</surname>
              <given-names>Yue</given-names>
            </name>
            <name>
              <surname>Hu</surname>
              <given-names>Liqun</given-names>
            </name>
            <name>
              <surname>Tong</surname>
              <given-names>Zheng</given-names>
            </name>
            <name>
              <surname>Jiang</surname>
              <given-names>Wei</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Evaluating the Cycling Comfort on Urban Roads Based on Cyclists’ Perception of Vibration</article-title>
          <source>Journal of Cleaner Production</source>
          <year>2018</year>
          <volume>192</volume>
          <pub-id pub-id-type="doi">10.1016/j.jclepro.2018.04.275</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192745">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Zang</surname>
              <given-names>Kaiyue</given-names>
            </name>
            <name>
              <surname>Shen</surname>
              <given-names>Jie</given-names>
            </name>
            <name>
              <surname>Huang</surname>
              <given-names>Haosheng</given-names>
            </name>
            <name>
              <surname>Wan</surname>
              <given-names>Mi</given-names>
            </name>
            <name>
              <surname>Shi</surname>
              <given-names>Jiafeng</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Assessing and Mapping of Road Surface Roughness Based on GPS and Accelerometer Sensors on Bicycle-Mounted Smartphones</article-title>
          <source>Sensors</source>
          <year>2018</year>
          <volume>18</volume>
          <issue>3</issue>
          <pub-id pub-id-type="doi">10.3390/s18030914</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192768">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Bíl</surname>
              <given-names>Michal</given-names>
            </name>
            <name>
              <surname>Andrášik</surname>
              <given-names>Richard</given-names>
            </name>
            <name>
              <surname>Kubeček</surname>
              <given-names>Jan</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>How Comfortable Are Your Cycling Tracks? A New Method for Objective Bicycle Vibration Measurement</article-title>
          <source>Transportation Research Part C: Emerging Technologies</source>
          <year>2015</year>
          <volume>56</volume>
          <pub-id pub-id-type="doi">10.1016/j.trc.2015.05.007</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192775">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Kranzinger</surname>
              <given-names>Stefan</given-names>
            </name>
            <name>
              <surname>Leitinger</surname>
              <given-names>Sven</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Eine Bestimmung der Oberflächenqualität von Fahrradinfrastruktur durch Smartphone-Beschleunigungsdaten mithilfe des k-means++-Algorithmus [A Determination of the Surface Quality of Bicycle Infrastructure from Smartphone Acceleration Data Using the k-means]</article-title>
          <source> Journal für Angewandte Geoinformatik</source>
          <year>2021</year>
          <volume>7</volume>
          <pub-id pub-id-type="doi">10.14627/537707013</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192771">
        <element-citation publication-type="website">
          <person-group person-group-type="author">
            <name>
              <surname>UN</surname>
              <given-names/>
            </name>
            <collab/>
          </person-group>
          <article-title>Cycling and Sustainable Development Goals</article-title>
          <source>United Nations Regional Information Centre for Western Europe</source>
          <year>2024</year>
          <uri>https://unric.org/en/sustainable-development-goals-cycling/</uri>
          <date-in-citation content-type="access-date">2024-11-20</date-in-citation>
        </element-citation>
      </ref>
      <ref id="R253322532192747">
        <element-citation publication-type="research-report">
          <person-group person-group-type="author">
            <name>
              <surname>EC</surname>
              <given-names/>
            </name>
            <collab/>
          </person-group>
          <article-title>Communication from the Commission to the European Parliament, the Council, the European Economic and Social Committee and the Committee of the Regions. Proposing a European Declaration on Cycling</article-title>
          <year>2023</year>
          <uri>https://eur-lex.europa.eu/legal-content/EN/TXT/?uri=CELEX:52023DC0566</uri>
          <comment>Brussels, 4.10.2023  COM(2023) 566 final</comment>
        </element-citation>
      </ref>
      <ref id="R253322532192770">
        <element-citation publication-type="research-report">
          <person-group person-group-type="author">
            <name>
              <surname>Holger</surname>
              <given-names>Heinfellner</given-names>
            </name>
            <name>
              <surname>Ibesich</surname>
              <given-names>Nikolaus</given-names>
            </name>
            <name>
              <surname>Kurzweil</surname>
              <given-names>Agnes</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Cycling Master Plan 2015 - 2025</article-title>
          <source>Federal Ministry of Agriculture, Forestry, Environment and Water Management</source>
          <year>2015</year>
          <uri>https://eionet.kormany.hu/download/7/a4/a1000/43_MP-Radfahren_englisch_web.pdf</uri>
        </element-citation>
      </ref>
      <ref id="R253322532192763">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Ayachi</surname>
              <given-names>Fouaz</given-names>
            </name>
            <name>
              <surname>Dorey</surname>
              <given-names>Jonathan</given-names>
            </name>
            <name>
              <surname>Guastavino</surname>
              <given-names>Catherine</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Identifying Factors of Bicycle Comfort: An Online Survey with Enthusiast Cyclists</article-title>
          <source>Applied Ergonomics</source>
          <year>2014</year>
          <volume>46</volume>
          <fpage>124</fpage>
          <pub-id pub-id-type="doi">10.1016/j.apergo.2014.07.010</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192746">
        <element-citation publication-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>CROW</surname>
              <given-names/>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <article-title>Design Manual for Bicycle Traffic</article-title>
          <volume>Record 28</volume>
          <publisher-name>CROW-fietsberaad</publisher-name>
          <publisher-loc>Rotterdam, the Netherlands</publisher-loc>
          <year>2016</year>
        </element-citation>
      </ref>
      <ref id="R253322532192752">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Landis</surname>
              <given-names>Bruce W</given-names>
            </name>
            <name>
              <surname>Vattikuti</surname>
              <given-names>Venkat R.</given-names>
            </name>
            <name>
              <surname>Brannick</surname>
              <given-names>Michael T.</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Real-Time Human Perceptions: Toward a Bicycle Level of Service</article-title>
          <source>Transportation Research Record</source>
          <year>1997</year>
          <volume>1578</volume>
          <issue>1</issue>
          <fpage>119</fpage>
          <lpage>126</lpage>
          <pub-id pub-id-type="doi">10.3141/1578-15</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192755">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Hoelzel</surname>
              <given-names>Christin</given-names>
            </name>
            <name>
              <surname>Höchtl</surname>
              <given-names>Franz</given-names>
            </name>
            <name>
              <surname>Senner</surname>
              <given-names>Veit</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Cycling Comfort on Different Road Surfaces</article-title>
          <source>Procedia Engineering</source>
          <year>2012</year>
          <volume>34</volume>
          <pub-id pub-id-type="doi">10.1016/j.proeng.2012.04.082</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192744">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Castañon</surname>
              <given-names>Ugo N</given-names>
            </name>
            <name>
              <surname>Ribeiro</surname>
              <given-names>Paulo J G</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Bikeability and Emerging Phenomena in Cycling: Exploratory Analysis and Review</article-title>
          <source>Sustainability</source>
          <year>2021</year>
          <volume>13</volume>
          <issue>4</issue>
          <pub-id pub-id-type="doi">10.3390/su13042394</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192760">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gadsby</surname>
              <given-names>April</given-names>
            </name>
            <name>
              <surname>Tsai</surname>
              <given-names>James</given-names>
            </name>
            <name>
              <surname>Watkins</surname>
              <given-names>Kari</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Understanding the Influence of Pavement Conditions on Cyclists’ Perception of Safety and Comfort Using Surveys and Eye Tracking</article-title>
          <source>Transportation Research Record: Journal of the Transportation Research Board</source>
          <year>2022</year>
          <volume>2676</volume>
          <issue>12</issue>
          <pub-id pub-id-type="doi">10.1177/03611981221090936</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192774">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Astarita</surname>
              <given-names>Vittorio</given-names>
            </name>
            <name>
              <surname>Rosolino</surname>
              <given-names>Vaiana</given-names>
            </name>
            <name>
              <surname>Teresa</surname>
              <given-names>Iuele</given-names>
            </name>
            <name>
              <surname>Vincenzo</surname>
              <given-names>Giofrè</given-names>
            </name>
            <name>
              <surname>Francesco</surname>
              <given-names>De Masi</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Automated Sensing System for Monitoring of Road Surface Quality by Mobile Devices</article-title>
          <source> Procedia - Social and Behavioral Sciences</source>
          <year>2014</year>
          <volume>111</volume>
          <pub-id pub-id-type="doi"> 10.1016/j.sbspro.2014.01.057</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192764">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Litzenberger</surname>
              <given-names>Stefan</given-names>
            </name>
            <name>
              <surname>Christensen</surname>
              <given-names>T</given-names>
            </name>
            <name>
              <surname>Hofstätter</surname>
              <given-names>O</given-names>
            </name>
            <name>
              <surname>Sabo</surname>
              <given-names>A</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Prediction of Road Surface Quality during Cycling Using Smartphone Accelerometer Data</article-title>
          <source>Proceedings</source>
          <year>2018</year>
          <volume>2</volume>
          <issue>6</issue>
          <fpage>217</fpage>
          <pub-id pub-id-type="doi">10.3390/proceedings2060217</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192776">
        <element-citation publication-type="research-report">
          <person-group person-group-type="author">
            <name>
              <surname>ISO</surname>
              <given-names/>
            </name>
            <collab/>
          </person-group>
          <article-title>Mechanical vibration and shock — Evaluation of human exposure to whole-body vibration</article-title>
          <source>International Standards Organisation</source>
          <year>1997</year>
          <fpage>2631</fpage>
          <lpage>2632</lpage>
          <uri>https://www.iso.org/standard/7612.html</uri>
          <comment>ISO 2631-1:1997</comment>
        </element-citation>
      </ref>
      <ref id="R253322532192761">
        <element-citation publication-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Sayers</surname>
              <given-names>Michael W</given-names>
            </name>
            <name>
              <surname>Karamihas</surname>
              <given-names>M Steven</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <source>The Little Book of Profiling</source>
          <publisher-name>University of Michigan</publisher-name>
          <publisher-loc>USA</publisher-loc>
          <year>1998</year>
          <uri>https://hdl.handle.net/2027.42/21605</uri>
        </element-citation>
      </ref>
      <ref id="R253322532192772">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Gadsby</surname>
              <given-names>April</given-names>
            </name>
            <name>
              <surname>Watkins</surname>
              <given-names>Kari</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Instrumented Bikes and Their Use in Studies on Transportation Behaviour, Safety, and Maintenance</article-title>
          <source>Transport Reviews</source>
          <year>2020</year>
          <volume>40</volume>
          <issue>6</issue>
          <pub-id pub-id-type="doi">10.1080/01441647.2020.1769227</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192756">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Yamanaka</surname>
              <given-names>Hideo</given-names>
            </name>
            <name>
              <surname>Namerikawa</surname>
              <given-names>Susumu</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Measuring Level-of-Service for Cycling of Urban Streets Using ‘Probe Bicycle System’</article-title>
          <source>Journal of the Eastern Asia Society for Transportation Studies</source>
          <year>2007</year>
          <volume>7</volume>
          <pub-id pub-id-type="doi">10.11175/EASTS.7.1614</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192769">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Yamanaka</surname>
              <given-names>Hideo</given-names>
            </name>
            <name>
              <surname>Xiaodong</surname>
              <given-names>Pan</given-names>
            </name>
            <name>
              <surname>Sanada</surname>
              <given-names>Junko</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Evaluation Models for Cyclists’ Perception Using Probe Bicycle System</article-title>
          <source>Journal of the Eastern Asia Society for Transportation Studies</source>
          <year>2013</year>
          <volume>10</volume>
          <pub-id pub-id-type="doi">10.11175/EASTS.10.1413</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192748">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Axhausen</surname>
              <given-names>W Kay</given-names>
            </name>
            <name>
              <surname>Smith</surname>
              <given-names>R L</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Bicyclist Link Evaluation: A Stated-Preference Approach</article-title>
          <source>Transportation Research Record</source>
          <year>1986</year>
          <volume>1085</volume>
          <uri>http://onlinepubs.trb.org/Onlinepubs/trr/1986/1085/1085-002.pdf</uri>
        </element-citation>
      </ref>
      <ref id="R253322532192773">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Olieman</surname>
              <given-names>Mark</given-names>
            </name>
            <name>
              <surname>Marin-Perianu</surname>
              <given-names>Raluca</given-names>
            </name>
            <name>
              <surname>Marin-Perianu</surname>
              <given-names>Mihai</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Measurement of Dynamic Comfort in Cycling Using Wireless Acceleration Sensors</article-title>
          <source>Procedia Engineering</source>
          <year>2012</year>
          <volume>34</volume>
          <pub-id pub-id-type="doi">10.1016/j.proeng.2012.04.097</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192742">
        <element-citation publication-type="inproceedings">
          <person-group person-group-type="author">
            <name>
              <surname>Neto</surname>
              <given-names>V A Gerson</given-names>
            </name>
            <name>
              <surname>Viana</surname>
              <given-names>Johnattan D. F.</given-names>
            </name>
            <name>
              <surname>Braga</surname>
              <given-names>Reinaldo B.</given-names>
            </name>
            <name>
              <surname>Oliveira</surname>
              <given-names>Carina T.</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <article-title>Surfaces Categorization Based on Data Collected by Bike Sensors</article-title>
          <source>Euro American Conference on Telematics and Information Systems</source>
          <year>2018</year>
          <conf-loc>Fortaleza, Brazil</conf-loc>
          <pub-id pub-id-type="doi">10.1145/3293614.3293625</pub-id>
          <conf-date>12–15 November 2018</conf-date>
        </element-citation>
      </ref>
      <ref id="R253322532192749">
        <element-citation publication-type="inproceedings">
          <person-group person-group-type="author">
            <name>
              <surname>Wijerathne</surname>
              <given-names>N.</given-names>
            </name>
            <name>
              <surname>Viswanath</surname>
              <given-names>S. K.</given-names>
            </name>
            <name>
              <surname>Hasala</surname>
              <given-names>M. S.</given-names>
            </name>
            <name>
              <surname>Beltran</surname>
              <given-names>V.</given-names>
            </name>
            <name>
              <surname>Yuen</surname>
              <given-names>C.</given-names>
            </name>
            <name>
              <surname>Lim</surname>
              <given-names>H. B.</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <article-title>Towards Comfortable Cycling: A Practical Approach to Monitor the Conditions in Cycling Paths</article-title>
          <source> IEEE World Forum on Internet of Things</source>
          <year>2018</year>
          <conf-loc>Singapore</conf-loc>
          <pub-id pub-id-type="doi">10.1109/WF-IoT.2018.8355173</pub-id>
          <conf-date>5–8 February 2018</conf-date>
        </element-citation>
      </ref>
      <ref id="R253322532192757">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Harikrishnan</surname>
              <given-names>P M</given-names>
            </name>
            <name>
              <surname>Gopi</surname>
              <given-names>Varun</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Vehicle Vibration Signal Processing for Road Surface Monitoring</article-title>
          <source>IEEE Sensors Journal</source>
          <year>2017</year>
          <volume>17</volume>
          <issue>16</issue>
          <pub-id pub-id-type="doi">10.1109/JSEN.2017.2719865</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192758">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Calvey</surname>
              <given-names>J</given-names>
            </name>
            <name>
              <surname>Shackleton</surname>
              <given-names>J P</given-names>
            </name>
            <name>
              <surname>Taylor</surname>
              <given-names>M D</given-names>
            </name>
            <name>
              <surname>Llewellyn</surname>
              <given-names>R.</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Engineering Condition Assessment of Cycling Infrastructure: Cyclists’ Perceptions of Satisfaction and Comfort</article-title>
          <source>Transportation Research Part A: Policy and Practice</source>
          <year>2015</year>
          <volume>78</volume>
          <pub-id pub-id-type="doi">10.1016/j.tra.2015.04.031</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532192767">
        <element-citation publication-type="inproceedings">
          <person-group person-group-type="author">
            <name>
              <surname>Taylor</surname>
              <given-names>M.</given-names>
            </name>
            <name>
              <surname>Fairfield</surname>
              <given-names>C. A.</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <article-title>Intelli-Bike: A Cycling Infrastructure Asset Management System</article-title>
          <source>Proceedings of the Bicycle and Motorcycle Dynamics 2010. Symposium on Dynamics and Control of Single Track Vehicles</source>
          <year>2010</year>
          <institution>Delft University of Technology</institution>
          <conf-loc>Delft, The Netherlands</conf-loc>
          <conf-date>20–22 October 2010</conf-date>
          <uri>http://bicycle.tudelft.nl/bmd2010/</uri>
        </element-citation>
      </ref>
      <ref id="R253322532192766">
        <element-citation publication-type="inproceedings">
          <person-group person-group-type="author">
            <name>
              <surname>Arthur</surname>
              <given-names>David</given-names>
            </name>
            <name>
              <surname>Vassilvitskii</surname>
              <given-names>Sergei</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <article-title>K-Means++ the Advantages of Careful Seeding</article-title>
          <source>ACM-SIAM symposium on Discrete algorithms</source>
          <year>2007</year>
          <conf-loc>New Orleans, LA, USA</conf-loc>
          <pub-id pub-id-type="doi">10.5555/1283383.1283494</pub-id>
          <conf-date>7–9 January 2007</conf-date>
        </element-citation>
      </ref>
      <ref id="R253322532192759">
        <element-citation publication-type="website">
          <person-group person-group-type="author">
            <name>
              <surname>ÖVDAT</surname>
              <given-names/>
            </name>
            <collab/>
          </person-group>
          <article-title>Die Graphenintegrations Plattform GIP. Das Referenzsystem der öffentlichen Hand für Verkehrsinfrastrukturdaten [The graph integration platform GIP. The public sector reference system for transport infrastructure data]</article-title>
          <source>Österreichisches Institut für Verkehrsdateninfrastruktur</source>
          <year>2022</year>
          <uri>https://www.gip.gv.at/</uri>
          <date-in-citation content-type="access-date">2024-11-09</date-in-citation>
        </element-citation>
      </ref>
      <ref id="R253322532192753">
        <element-citation publication-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Rehrl</surname>
              <given-names>Karl</given-names>
            </name>
            <name>
              <surname>Gröchenig</surname>
              <given-names>Simon</given-names>
            </name>
            <name>
              <surname>Wimmer</surname>
              <given-names>Michael</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor">
            <name>
              <surname>Mansourian</surname>
              <given-names>A</given-names>
            </name>
            <name>
              <surname>Pilesjö</surname>
              <given-names>P</given-names>
            </name>
            <name>
              <surname>Harrie</surname>
              <given-names>L</given-names>
            </name>
            <name>
              <surname>van Lammeren</surname>
              <given-names>R</given-names>
            </name>
          </person-group>
          <article-title>Optimization and Evaluation of a High-Performance Open-Source Map-Matching Implementation</article-title>
          <source>Geospatial Technologies for All</source>
          <publisher-name>Springer</publisher-name>
          <publisher-loc>Cham, Switzerland</publisher-loc>
          <year>2018</year>
          <pub-id pub-id-type="doi">10.1007/978-3-319-78208-9_13</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532365088">
        <element-citation publication-type="journal">
          <person-group person-group-type="author">
            <name>
              <surname>Blair</surname>
              <given-names>Julian</given-names>
            </name>
            <name>
              <surname>Lacy</surname>
              <given-names>Michael G</given-names>
            </name>
            <collab/>
          </person-group>
          <article-title>Statistics of Ordinal Variation</article-title>
          <source>Sociological Methods &amp; Research</source>
          <year>2000</year>
          <volume>28</volume>
          <issue>3</issue>
          <pub-id pub-id-type="doi">10.1177/0049124100028003001</pub-id>
        </element-citation>
      </ref>
      <ref id="R253322532329955">
        <element-citation publication-type="book">
          <person-group person-group-type="author">
            <name>
              <surname>Corder</surname>
              <given-names>Gregory W</given-names>
            </name>
            <name>
              <surname>Foreman</surname>
              <given-names>Dale I</given-names>
            </name>
            <collab/>
          </person-group>
          <person-group person-group-type="editor"/>
          <source>Nonparametric Statistics: A Step-by-Step Approach</source>
          <publisher-name>John Wiley &amp; Sons</publisher-name>
          <publisher-loc>New Jersey, USA</publisher-loc>
          <year>2014</year>
        </element-citation>
      </ref>
    </ref-list>
  </back>
</article>
