Markus Vorhauer,
"Application of FPGA to Machine Learning: Cluster Regression"
, 9-2022
Original Titel:
Application of FPGA to Machine Learning: Cluster Regression
Sprache des Titels:
Englisch
Original Kurzfassung:
In this thesis, a new approach for a Machine Learning algorithm named Cluster Regression is introduced, which is developed and designed to make an e?cient hardware realization on an FPGA. The idea is based on Random Forest regression, which is a very successful machine learning technique, but here the splitting during the learning process is ?nal and greedy. It could be an improvement if the elements chosen for model ?tting are re-selectable in terms of to which segment they belong. This new algorithm uses the unsupervised technique K-Means to gain additional information for a supervised regression task. Additionally, within the presented algorithm each region is described by a linear plane instead of the mean. Several simulations with di?erent datasets are done to ensure a good estimation perfor- mance of the digital hardware realization. The results of di?erent versions of the algorithm are compared to selected Machine Learning regressors. As the simulations show, depending on the dataset, the Cluster Regression algorithm outperforms Random Forest regression, Multiple Linear regression, and Gaussian Kernel regression. The hardware design is fully synthesisable on an FPGA and can be adjusted to the needed precision and hyper-parameter set. As a result, the presented work is useful in selected regression tasks. Furthermore, due to the parallelized hardware design, the learning time can be reduced by a multiple depend- ing on the compared sequential computer. Additionally, this calculation time is linearly proportional to the learning data size due to several hardware optimization techniques. The prediction time of new outputs is short and also linearly proportional to the size of the testing data. Concerning future research, this work could help to connect Machine Learning with digital hardware design even more since there is a big potential for the improvement of e?ciency and calculation speed.