Implementing quantum k-means clustering algorithm
Bui, Thi Bach Duong (2024-05-03)
Bui, Thi Bach Duong
T. B. D. Bui
03.05.2024
© 2024 Thi Bach Duong Bui. Ellei toisin mainita, uudelleenkäyttö on sallittu Creative Commons Attribution 4.0 International (CC-BY 4.0) -lisenssillä (https://creativecommons.org/licenses/by/4.0/). Uudelleenkäyttö on sallittua edellyttäen, että lähde mainitaan asianmukaisesti ja mahdolliset muutokset merkitään. Sellaisten osien käyttö tai jäljentäminen, jotka eivät ole tekijän tai tekijöiden omaisuutta, saattaa edellyttää lupaa suoraan asianomaisilta oikeudenhaltijoilta.
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:oulu-202405033107
https://urn.fi/URN:NBN:fi:oulu-202405033107
Tiivistelmä
The thesis reviewed a previous quantum approach for k-means clustering, identified its limitations, and proposed a method to address these shortcomings. Additionally, it introduced an algorithm to calculate multiple distances simultaneously, aiming to enhance the efficiency of the quantum k-means clustering program.
Through the thesis studies, the following conclusions were drawn: Firstly, employing quantum computers for computing Euclidean distances, a crucial task in the k-means clustering algorithm, is feasible. The Euclidean distances computed by quantum computers closely approximate those calculated by classical computers. Secondly, the thesis demonstrated the possibility of calculating multiple distances simultaneously in one quantum circuit, thus improving efficiency. Thirdly, the quantum k-means clustering algorithm, which utilizes this quantum circuit for Euclidean distance calculation, performs equally well with classical k-means clustering algorithms in small-scale problems. However, in large-scale problems, the quantum k-means clustering algorithm produces slightly less accurate results.
Regarding quantum advantage, theoretically, quantum computers would exponentially speed up the state preparation process for calculating quantum distance, as it requires only logarithmic qubits to represent the dataset. However, since classical calculation is still needed to compute the distance from the results obtained from the quantum computer, the quantum speedup for the whole task is not significant.
Through the thesis studies, the following conclusions were drawn: Firstly, employing quantum computers for computing Euclidean distances, a crucial task in the k-means clustering algorithm, is feasible. The Euclidean distances computed by quantum computers closely approximate those calculated by classical computers. Secondly, the thesis demonstrated the possibility of calculating multiple distances simultaneously in one quantum circuit, thus improving efficiency. Thirdly, the quantum k-means clustering algorithm, which utilizes this quantum circuit for Euclidean distance calculation, performs equally well with classical k-means clustering algorithms in small-scale problems. However, in large-scale problems, the quantum k-means clustering algorithm produces slightly less accurate results.
Regarding quantum advantage, theoretically, quantum computers would exponentially speed up the state preparation process for calculating quantum distance, as it requires only logarithmic qubits to represent the dataset. However, since classical calculation is still needed to compute the distance from the results obtained from the quantum computer, the quantum speedup for the whole task is not significant.
Kokoelmat
- Avoin saatavuus [37920]