Publications – Michael Kamp

Salazer, Thomas L; Sheth, Naitik; Masud, Avais; Serur, David; Hidalgo, Guillermo; Aqeel, Iram; Adilova, Linara; Kamp, Michael; Fitzpatrick, Tim; Krishnan, Sriram; Rao, Kanishka; Rao, Bharat

Artificial Intelligence (AI)-Driven Screening for Undiscovered CKD Journal Article

In: Journal of the American Society of Nephrology, vol. 35, iss. 10S, pp. 10.1681, 2024.

BibTeX | Tags: CKD, healthcare, medicine, nephrology

Artificial Intelligence (AI)-Driven Screening for Undiscovered CKD

Singh, Sidak Pal; Adilova, Linara; Kamp, Michael; Fischer, Asja; Schölkopf, Bernhard; Hofmann, Thomas

Landscaping Linear Mode Connectivity Proceedings Article

In: ICML Workshop on High-dimensional Learning Dynamics: The Emergence of Structure and Reasoning, 2024.

BibTeX | Tags: deep learning, linear mode connectivity, theory of deep learning

Chen, Siming; Gou, Liang; Kamp, Michael; Sunr, Dong

Visual Computing for Autonomous Driving Journal Article

In: IEEE Computer Graphics and Applications, vol. 44, iss. 3, pp. 11-13, 2024.

BibTeX | Tags:

Adilova, Linara; Andriushchenko, Maksym; Fischer, Michael Kamp Asja; Jaggi, Martin

Layer-wise Linear Mode Connectivity Proceedings Article

In: International Conference on Learning Representations (ICLR), Curran Associates, Inc, 2024.

Abstract | Links | BibTeX | Tags: deep learning, layer-wise, linear mode connectivity

Yang, Fan; Bodic, Pierre Le; Kamp, Michael; Boley, Mario

Orthogonal Gradient Boosting for Interpretable Additive Rule Ensembles Proceedings Article

In: Proceedings of the 26th International Conference on Artificial Intelligence and Statistics (AISTATS), 2024.

Abstract | Links | BibTeX | Tags: complexity, explainability, interpretability, interpretable, machine learning, rule ensemble, rule mining, XAI

Orthogonal Gradient Boosting for Interpretable Additive Rule Ensembles

2023

Adilova, Linara; Abourayya, Amr; Li, Jianning; Dada, Amin; Petzka, Henning; Egger, Jan; Kleesiek, Jens; Kamp, Michael

FAM: Relative Flatness Aware Minimization Proceedings Article

In: Proceedings of the ICML Workshop on Topology, Algebra, and Geometry in Machine Learning (TAG-ML), 2023.

Links | BibTeX | Tags: deep learning, flatness, generalization, machine learning, relative flatness, theory of deep learning

FAM: Relative Flatness Aware Minimization

Michael Kamp Linara Adilova, Gennady Andrienko

Re-interpreting Rules Interpretability Journal Article

In: International Journal of Data Science and Analytics, 2023.

BibTeX | Tags: interpretable, machine learning, rule learning, XAI

Kamp, Michael; Fischer, Jonas; Vreeken, Jilles

Federated Learning from Small Datasets Proceedings Article

In: International Conference on Learning Representations (ICLR), 2023.

Links | BibTeX | Tags: black-box, black-box parallelization, daisy, daisy-chaining, FedDC, federated learning, small, small datasets

David Kaltenpoth Osman Mian, Michael Kamp

Nothing but Regrets - Privacy-Preserving Federated Causal Discovery Proceedings Article

In: International Conference on Artificial Intelligence and Statistics (AISTATS), 2023.

BibTeX | Tags: causal discovery, causality, explainable, federated, federated causal discovery, federated learning, interpretable

Nothing but Regrets - Privacy-Preserving Federated Causal Discovery

Mian, Osman; Kamp, Michael; Vreeken, Jilles

Information-Theoretic Causal Discovery and Intervention Detection over Multiple Environments Proceedings Article

In: Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023.

BibTeX | Tags: causal discovery, causality, federated, federated causal discovery, federated learning, intervention

Information-Theoretic Causal Discovery and Intervention Detection over Multiple Environments

Li, Jianning; Ferreira, André; Puladi, Behrus; Alves, Victor; Kamp, Michael; Kim, Moon; Nensa, Felix; Kleesiek, Jens; Ahmadi, Seyed-Ahmad; Egger, Jan

Open-source skull reconstruction with MONAI Journal Article

In: SoftwareX, vol. 23, pp. 101432, 2023.

BibTeX | Tags:

Open-source skull reconstruction with MONAI

Adilova, Linara; Chen, Siming; Kamp, Michael

Informed Novelty Detection in Sequential Data by Per-Cluster Modeling Proceedings Article

In: ICML workshop on Artificial Intelligence & Human Computer Interaction, 2023.

Links | BibTeX | Tags:

Informed Novelty Detection in Sequential Data by Per-Cluster Modeling

2022

Wang, Junhong; Li, Yun; Zhou, Zhaoyu; Wang, Chengshun; Hou, Yijie; Zhang, Li; Xue, Xiangyang; Kamp, Michael; Zhang, Xiaolong; Chen, Siming

When, Where and How does it fail? A Spatial-temporal Visual Analytics Approach for Interpretable Object Detection in Autonomous Driving Journal Article

In: IEEE Transactions on Visualization and Computer Graphics, 2022.

BibTeX | Tags:

When, Where and How does it fail? A Spatial-temporal Visual Analytics Approach for Interpretable Object Detection in Autonomous Driving

Mian, Osman; Kaltenpoth, David; Kamp, Michael

Regret-based Federated Causal Discovery Proceedings Article

In: The KDD'22 Workshop on Causal Discovery, pp. 61–69, PMLR 2022.

BibTeX | Tags:

2021

Petzka, Henning; Kamp, Michael; Adilova, Linara; Sminchisescu, Cristian; Boley, Mario

Relative Flatness and Generalization Proceedings Article

In: Advances in Neural Information Processing Systems, Curran Associates, Inc., 2021.

Abstract | BibTeX | Tags: deep learning, flatness, generalization, Hessian, learning theory, relative flatness, theory of deep learning

Linsner, Florian; Adilova, Linara; Däubener, Sina; Kamp, Michael; Fischer, Asja

Approaches to Uncertainty Quantification in Federated Deep Learning Workshop

Machine Learning and Principles and Practice of Knowledge Discovery in Databases: International Workshops of ECML PKDD 2021, vol. 2, Springer, 2021.

Links | BibTeX | Tags: federated learning, uncertainty

Approaches to Uncertainty Quantification in Federated Deep Learning

Li, Xiaoxiao; Jiang, Meirui; Zhang, Xiaofei; Kamp, Michael; Dou, Qi

FedBN: Federated Learning on Non-IID Features via Local Batch Normalization Proceedings Article

In: Proceedings of the 9th International Conference on Learning Representations (ICLR), 2021.

Abstract | Links | BibTeX | Tags: batch normalization, black-box parallelization, deep learning, federated learning

@inproceedings{li2021fedbn,

title = {FedBN: Federated Learning on Non-IID Features via Local Batch Normalization},

author = {Xiaoxiao Li and Meirui Jiang and Xiaofei Zhang and Michael Kamp and Qi Dou},

url = {https://michaelkamp.org/wp-content/uploads/2021/05/fedbn_federated_learning_on_non_iid_features_via_local_batch_normalization.pdf

https://michaelkamp.org/wp-content/uploads/2021/05/FedBN_appendix.pdf},

year  = {2021},

date = {2021-05-03},

urldate = {2021-05-03},

booktitle = {Proceedings of the 9th International Conference on Learning Representations (ICLR)},

abstract = {The emerging paradigm of federated learning (FL) strives to enable collaborative training of deep models on the network edge without centrally aggregating raw data and hence improving data privacy. In most cases, the assumption of independent and identically distributed samples across local clients does not hold for federated learning setups. Under this setting, neural network training performance may vary significantly according to the data distribution and even hurt training convergence. Most of the previous work has focused on a difference in the distribution of labels or client shifts. Unlike those settings, we address an important problem of FL, e.g., different scanners/sensors in medical imaging, different scenery distribution in autonomous driving (highway vs. city), where local clients store examples with different distributions compared to other clients, which we denote as feature shift non-iid. In this work, we propose an effective method that uses local batch normalization to alleviate the feature shift before averaging models. The resulting scheme, called FedBN, outperforms both classical FedAvg, as well as the state-of-the-art for non-iid data (FedProx) on our extensive experiments. These empirical results are supported by a convergence analysis that shows in a simplified setting that FedBN has a faster convergence rate than FedAvg. Code is available at https://github.com/med-air/FedBN.},

keywords = {batch normalization, black-box parallelization, deep learning, federated learning},

pubstate = {published},

tppubtype = {inproceedings}

}

FedBN: Federated Learning on Non-IID Features via Local Batch Normalization

2020

Heppe, Lukas; Kamp, Michael; Adilova, Linara; Piatkowski, Nico; Heinrich, Danny; Morik, Katharina

Resource-Constrained On-Device Learning by Dynamic Averaging Workshop

Proceedings of the Workshop on Parallel, Distributed, and Federated Learning (PDFL) at ECMLPKDD, 2020.

Abstract | Links | BibTeX | Tags: black-box parallelization, distributed learning, edge computing, embedded, exponential family, FPGA, resource-efficient

@workshop{heppe2020resource,

title = {Resource-Constrained On-Device Learning by Dynamic Averaging},

author = {Lukas Heppe and Michael Kamp and Linara Adilova and Nico Piatkowski and Danny Heinrich and Katharina Morik},

url = {https://michaelkamp.org/wp-content/uploads/2020/10/Resource_Constrained_Federated_Learning-1.pdf},

year  = {2020},

date = {2020-09-14},

urldate = {2020-09-14},

booktitle = {Proceedings of the Workshop on Parallel, Distributed, and Federated Learning (PDFL) at ECMLPKDD},

abstract = {The communication between data-generating devices is partially responsible for a growing portion of the world’s power consumption. Thus reducing communication is vital, both, from an economical and an ecological perspective. For machine learning, on-device learning avoids sending raw data, which can reduce communication substantially. Furthermore, not centralizing the data protects privacy-sensitive data. However, most learning algorithms require hardware with high computation power and thus high energy consumption. In contrast, ultra-low-power processors, like FPGAs or micro-controllers, allow for energy-efficient learning of local models. Combined with communication-efficient distributed learning strategies, this reduces the overall energy consumption and enables applications that were yet impossible due to limited energy on local devices. The major challenge is then, that the low-power processors typically only have integer processing capabilities. This paper investigates an approach to communication-efficient on-device learning of integer exponential families that can be executed on low-power processors, is privacy-preserving, and effectively minimizes communication. The empirical evaluation shows that the approach can reach a model quality comparable to a centrally learned regular model with an order of magnitude less communication. Comparing the overall energy consumption, this reduces the required energy for solving the machine learning task by a significant amount.},

keywords = {black-box parallelization, distributed learning, edge computing, embedded, exponential family, FPGA, resource-efficient},

pubstate = {published},

tppubtype = {workshop}

}

Resource-Constrained On-Device Learning by Dynamic Averaging

Petzka, Henning; Adilova, Linara; Kamp, Michael; Sminchisescu, Cristian

Feature-Robustness, Flatness and Generalization Error for Deep Neural Networks Workshop

2020.

Links | BibTeX | Tags: deep learning, flatness, generalization, learning theory, loss surface, neural networks, robustness

Welke, Pascal; Seiffarth, Florian; Kamp, Michael; Wrobel, Stefan

HOPS: Probabilistic Subtree Mining for Small and Large Graphs Proceedings Article

In: Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1275–1284, Association for Computing Machinery, Virtual Event, CA, USA, 2020, ISBN: 9781450379984.

Abstract | Links | BibTeX | Tags:

@inproceedings{10.1145/3394486.3403180,

title = {HOPS: Probabilistic Subtree Mining for Small and Large Graphs},

author = {Pascal Welke and Florian Seiffarth and Michael Kamp and Stefan Wrobel},

url = {https://doi.org/10.1145/3394486.3403180},

doi = {10.1145/3394486.3403180},

isbn = {9781450379984},

year  = {2020},

date = {2020-01-01},

urldate = {2020-01-01},

booktitle = {Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining},

pages = {1275–1284},

publisher = {Association for Computing Machinery},

address = {Virtual Event, CA, USA},

series = {KDD '20},

abstract = {Frequent subgraph mining, i.e., the identification of relevant patterns in graph databases, is a well-known data mining problem with high practical relevance, since next to summarizing the data, the resulting patterns can also be used to define powerful domain-specific similarity functions for prediction. In recent years, significant progress has been made towards subgraph mining algorithms that scale to complex graphs by focusing on tree patterns and probabilistically allowing a small amount of incompleteness in the result. Nonetheless, the complexity of the pattern matching component used for deciding subtree isomorphism on arbitrary graphs has significantly limited the scalability of existing approaches. In this paper, we adapt sampling techniques from mathematical combinatorics to the problem of probabilistic subtree mining in arbitrary databases of many small to medium-size graphs or a single large graph. By restricting on tree patterns, we provide an algorithm that approximately counts or decides subtree isomorphism for arbitrary transaction graphs in sub-linear time with one-sided error. Our empirical evaluation on a range of benchmark graph datasets shows that the novel algorithm substantially outperforms state-of-the-art approaches both in the task of approximate counting of embeddings in single large graphs and in probabilistic frequent subtree mining in large databases of small to medium sized graphs.},

keywords = {},

pubstate = {published},

tppubtype = {inproceedings}

}

HOPS: Probabilistic Subtree Mining for Small and Large Graphs

2019

Kamp, Michael

Black-Box Parallelization for Machine Learning PhD Thesis

Universitäts-und Landesbibliothek Bonn, 2019.

Abstract | Links | BibTeX | Tags: averaging, black-box, communication-efficient, convex optimization, deep learning, distributed, dynamic averaging, federated, learning theory, machine learning, parallelization, privacy, radon machine

@phdthesis{kamp2019black,

title = {Black-Box Parallelization for Machine Learning},

author = {Michael Kamp},

url = {https://d-nb.info/1200020057/34},

year  = {2019},

date = {2019-01-01},

urldate = {2019-01-01},

school = {Universitäts-und Landesbibliothek Bonn},

abstract = {The landscape of machine learning applications is changing rapidly: large centralized datasets are replaced by high volume, high velocity data streams generated by a vast number of geographically distributed, loosely connected devices, such as mobile phones, smart sensors, autonomous vehicles or industrial machines. Current learning approaches centralize the data and process it in parallel in a cluster or computing center. This has three major disadvantages: (i) it does not scale well with the number of data-generating devices since their growth exceeds that of computing centers, (ii) the communication costs for centralizing the data are prohibitive in many applications, and (iii) it requires sharing potentially privacy-sensitive data. Pushing computation towards the data-generating devices alleviates these problems and allows to employ their otherwise unused computing power. However, current parallel learning approaches are designed for tightly integrated systems with low latency and high bandwidth, not for loosely connected distributed devices. Therefore, I propose a new paradigm for parallelization that treats the learning algorithm as a black box, training local models on distributed devices and aggregating them into a single strong one. Since this requires only exchanging models instead of actual data, the approach is highly scalable, communication-efficient, and privacy-preserving.

Following this paradigm, this thesis develops black-box parallelizations for two broad classes of learning algorithms. One approach can be applied to incremental learning algorithms, i.e., those that improve a model in iterations. Based on the utility of aggregations it schedules communication dynamically, adapting it to the hardness of the learning problem. In practice, this leads to a reduction in communication by orders of magnitude. It is analyzed for (i) online learning, in particular in the context of in-stream learning, which allows to guarantee optimal regret and for (ii) batch learning based on empirical risk minimization where optimal convergence can be guaranteed. The other approach is applicable to non-incremental algorithms as well. It uses a novel aggregation method based on the Radon point that allows to achieve provably high model quality with only a single aggregation. This is achieved in polylogarithmic runtime on quasi-polynomially many processors. This relates parallel machine learning to Nick’s class of parallel decision problems and is a step towards answering a fundamental open problem about the abilities and limitations of efficient parallel learning algorithms. An empirical study on real distributed systems confirms the potential of the approaches in realistic application scenarios.},

keywords = {averaging, black-box, communication-efficient, convex optimization, deep learning, distributed, dynamic averaging, federated, learning theory, machine learning, parallelization, privacy, radon machine},

pubstate = {published},

tppubtype = {phdthesis}

}

The landscape of machine learning applications is changing rapidly: large centralized datasets are replaced by high volume, high velocity data streams generated by a vast number of geographically distributed, loosely connected devices, such as mobile phones, smart sensors, autonomous vehicles or industrial machines. Current learning approaches centralize the data and process it in parallel in a cluster or computing center. This has three major disadvantages: (i) it does not scale well with the number of data-generating devices since their growth exceeds that of computing centers, (ii) the communication costs for centralizing the data are prohibitive in many applications, and (iii) it requires sharing potentially privacy-sensitive data. Pushing computation towards the data-generating devices alleviates these problems and allows to employ their otherwise unused computing power. However, current parallel learning approaches are designed for tightly integrated systems with low latency and high bandwidth, not for loosely connected distributed devices. Therefore, I propose a new paradigm for parallelization that treats the learning algorithm as a black box, training local models on distributed devices and aggregating them into a single strong one. Since this requires only exchanging models instead of actual data, the approach is highly scalable, communication-efficient, and privacy-preserving.
Following this paradigm, this thesis develops black-box parallelizations for two broad classes of learning algorithms. One approach can be applied to incremental learning algorithms, i.e., those that improve a model in iterations. Based on the utility of aggregations it schedules communication dynamically, adapting it to the hardness of the learning problem. In practice, this leads to a reduction in communication by orders of magnitude. It is analyzed for (i) online learning, in particular in the context of in-stream learning, which allows to guarantee optimal regret and for (ii) batch learning based on empirical risk minimization where optimal convergence can be guaranteed. The other approach is applicable to non-incremental algorithms as well. It uses a novel aggregation method based on the Radon point that allows to achieve provably high model quality with only a single aggregation. This is achieved in polylogarithmic runtime on quasi-polynomially many processors. This relates parallel machine learning to Nick’s class of parallel decision problems and is a step towards answering a fundamental open problem about the abilities and limitations of efficient parallel learning algorithms. An empirical study on real distributed systems confirms the potential of the approaches in realistic application scenarios.

Black-Box Parallelization for Machine Learning

Adilova, Linara; Natious, Livin; Chen, Siming; Thonnard, Olivier; Kamp, Michael

System Misuse Detection via Informed Behavior Clustering and Modeling Workshop

2019 49th Annual IEEE/IFIP International Conference on Dependable Systems and Networks Workshops (DSN-W), IEEE 2019.

Links | BibTeX | Tags: anomaly detection, cybersecurity, DiSIEM, security, user behavior modelling, visualization

System Misuse Detection via Informed Behavior Clustering and Modeling

Petzka, Henning; Adilova, Linara; Kamp, Michael; Sminchisescu, Cristian

A Reparameterization-Invariant Flatness Measure for Deep Neural Networks Workshop

Science meets Engineering of Deep Learning workshop at NeurIPS, 2019.

Links | BibTeX | Tags: deep learning, flatness, generalization, learning theory, loss surface, neural networks, robustness

A Reparameterization-Invariant Flatness Measure for Deep Neural Networks

Adilova, Linara; Rosenzweig, Julia; Kamp, Michael

Information Theoretic Perspective of Federated Learning Workshop

NeurIPS Workshop on Information Theory and Machine Learning, 2019.

Links | BibTeX | Tags:

Information Theoretic Perspective of Federated Learning

2018

Giesselbach, Sven; Ullrich, Katrin; Kamp, Michael; Paurat, Daniel; Gärtner, Thomas

Corresponding Projections for Orphan Screening Workshop

Proceedings of the ML4H workshop at NeurIPS, 2018.

Links | BibTeX | Tags: corresponding projections, transfer learning, unsupervised

Corresponding Projections for Orphan Screening

Nguyen, Phong H.; Chen, Siming; Andrienko, Natalia; Kamp, Michael; Adilova, Linara; Andrienko, Gennady; Thonnard, Olivier; Bessani, Alysson; Turkay, Cagatay

Designing Visualisation Enhancements for SIEM Systems Workshop

15th IEEE Symposium on Visualization for Cyber Security – VizSec, 2018.

Links | BibTeX | Tags: DiSIEM, SIEM, visual analytics, visualization

Designing Visualisation Enhancements for SIEM Systems

Kamp, Michael; Adilova, Linara; Sicking, Joachim; Hüger, Fabian; Schlicht, Peter; Wirtz, Tim; Wrobel, Stefan

Efficient Decentralized Deep Learning by Dynamic Model Averaging Proceedings Article

In: Machine Learning and Knowledge Discovery in Databases, Springer, 2018.

Abstract | Links | BibTeX | Tags: decentralized, deep learning, federated learning

Efficient Decentralized Deep Learning by Dynamic Model Averaging

2017

Gunar Ernis, Michael Kamp

Machine Learning für die smarte Produktion Journal Article

In: VDMA-Nachrichten, pp. 36-37, 2017.

Links | BibTeX | Tags: industry 4.0, machine learning, smart production

Flouris, Ioannis; Giatrakos, Nikos; Deligiannakis, Antonios; Garofalakis, Minos; Kamp, Michael; Mock, Michael

Issues in Complex Event Processing: Status and Prospects in the Big Data Era Journal Article

In: Journal of Systems and Software, 2017.

BibTeX | Tags:

Issues in Complex Event Processing: Status and Prospects in the Big Data Era

Kamp, Michael; Boley, Mario; Missura, Olana; Gärtner, Thomas

Effective Parallelisation for Machine Learning Proceedings Article

In: Advances in Neural Information Processing Systems, pp. 6480–6491, 2017.

Abstract | Links | BibTeX | Tags: decentralized, distributed, machine learning, parallelization, radon

Effective Parallelisation for Machine Learning

Ullrich, Katrin; Kamp, Michael; Gärtner, Thomas; Vogt, Martin; Wrobel, Stefan

Co-regularised support vector regression Proceedings Article

In: Joint European Conference on Machine Learning and Knowledge Discovery in Databases, pp. 338–354, Springer 2017.

Links | BibTeX | Tags: co-regularization, transfer learning, unsupervised

Co-regularised support vector regression

2016

Kamp, Michael; Bothe, Sebastian; Boley, Mario; Mock, Michael

Communication-Efficient Distributed Online Learning with Kernels Proceedings Article

In: Frasconi, Paolo; Landwehr, Niels; Manco, Giuseppe; Vreeken, Jilles (Ed.): Machine Learning and Knowledge Discovery in Databases, pp. 805–819, Springer International Publishing, 2016.

Abstract | Links | BibTeX | Tags: communication-efficient, distributed, dynamic averaging, federated learning, kernel methods, parallelization

Communication-Efficient Distributed Online Learning with Kernels

Ullrich, Katrin; Kamp, Michael; Gärtner, Thomas; Vogt, Martin; Wrobel, Stefan

Ligand-based virtual screening with co-regularised support Vector Regression Proceedings Article

In: 2016 IEEE 16th international conference on data mining workshops (ICDMW), pp. 261–268, IEEE 2016.

Abstract | Links | BibTeX | Tags: biology, chemistry, corresponding projections, semi-supervised

@inproceedings{ullrich2016ligand,

title = {Ligand-based virtual screening with co-regularised support Vector Regression},

author = {Katrin Ullrich and Michael Kamp and Thomas Gärtner and Martin Vogt and Stefan Wrobel},

url = {http://michaelkamp.org/wp-content/uploads/2020/03/LigandBasedCoSVR.pdf},

year  = {2016},

date = {2016-01-01},

urldate = {2016-01-01},

booktitle = {2016 IEEE 16th international conference on data mining workshops (ICDMW)},

pages = {261--268},

organization = {IEEE},

abstract = {We consider the problem of ligand affinity prediction as a regression task, typically with few labelled examples, many unlabelled instances, and multiple views on the data. In chemoinformatics, the prediction of binding affinities for protein ligands is an important but also challenging task. As protein-ligand bonds trigger biochemical reactions, their characterisation is a crucial step in the process of drug discovery and design. However, the practical determination of ligand affinities is very expensive, whereas unlabelled compounds are available in abundance. Additionally, many different vectorial representations for compounds (molecular fingerprints) exist that cover different sets of features. To this task we propose to apply a co-regularisation approach, which extracts information from unlabelled examples by ensuring that individual models trained on different fingerprints make similar predictions. We extend support vector regression similarly to the existing co-regularised least squares regression (CoRLSR) and obtain a co-regularised support vector regression (CoSVR). We empirically evaluate the performance of CoSVR on various protein-ligand datasets. We show that CoSVR outperforms CoRLSR as well as existing state-of-the-art approaches that do not take unlabelled molecules into account. Additionally, we provide a theoretical bound on the Rademacher complexity for CoSVR.},

keywords = {biology, chemistry, corresponding projections, semi-supervised},

pubstate = {published},

tppubtype = {inproceedings}

}