Learn how Moment-Sum-of-Squares relaxation improves optimization for machine learning models when standard SDP methods fail to find global optima.Learn how Moment-Sum-of-Squares relaxation improves optimization for machine learning models when standard SDP methods fail to find global optima.

Improving Global Optimization in HSVM and SDP Problems

2026/01/14 23:30
3 min read
For feedback or concerns regarding this content, please contact us at [email protected]

Abstract and 1. Introduction

  1. Related Works

  2. Convex Relaxation Techniques for Hyperbolic SVMs

    3.1 Preliminaries

    3.2 Original Formulation of the HSVM

    3.3 Semidefinite Formulation

    3.4 Moment-Sum-of-Squares Relaxation

  3. Experiments

    4.1 Synthetic Dataset

    4.2 Real Dataset

  4. Discussions, Acknowledgements, and References

    \

A. Proofs

B. Solution Extraction in Relaxed Formulation

C. On Moment Sum-of-Squares Relaxation Hierarchy

D. Platt Scaling [31]

E. Detailed Experimental Results

F. Robust Hyperbolic Support Vector Machine

3.4 Moment-Sum-of-Squares Relaxation

The SDP relaxation in Equation (8) may not be tight, particularly when the resulting W has a rank much larger than 1. Indeed, we often find W to be full-rank empirically. In such cases, moment-sum-of-squares relaxation may be beneficial. Specifically, it can certifiably find the global optima, provided that the solution exhibits a special structure, known as the flat-extension property [30, 32].

\

\ With all these definitions established, we can present the moment-sum-of-squares relaxation [9] to the HSVM problem, outlined in Equation (7), as

\

\ Note that 𝑔(q) ⩟ 0, as previously defined, serves as constraints in the original formulation. Additionally, when forming the moment matrix, the degree of generated monomials is 𝑠 = 𝜅 − 1, since all constraints in Equation (7) has maximum degree 1. Consequently, Equation (13) is a convex programming and can be implemented as a standard SDP problem using mainstream solvers. We further emphasize that by progressively increasing the relaxation order 𝜅, we can find increasingly better solutions theoretically, as suggested by Lasserre [33]

\

\ where đ” is an index set of the moment matrix to entries generated by w along, ensuring that each moment matrix with overlapping regions share the same values as required. We refer the last constraint as the sparse-binding constraint.

\ Unfortunately, our solution empirically does not satisfy the flat-extension property and we cannot not certify global optimality. Nonetheless, in practice, it achieves significant performance improvements in selected datasets over both projected gradient descent and the SDP-relaxed formulation. Similarly, this formulation does not directly yield decision boundaries and we defer discussions on the extraction methods to Appendix B.2.

\ Figure 2: Star-shaped Sparsity pattern in Equation (13) visualized with 𝑛 = 4

\

:::info Authors:

(1) Sheng Yang, John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA ([email protected]);

(2) Peihan Liu, John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA ([email protected]);

(3) Cengiz Pehlevan, John A. Paulson School of Engineering and Applied Sciences, Harvard University, Cambridge, MA, Center for Brain Science, Harvard University, Cambridge, MA, and Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Cambridge, MA ([email protected]).

:::


:::info This paper is available on arxiv under CC by-SA 4.0 Deed (Attribution-Sharealike 4.0 International) license.

:::

\

Market Opportunity
Brainedge Logo
Brainedge Price(LEARN)
$0.006864
$0.006864$0.006864
-2.31%
USD
Brainedge (LEARN) Live Price Chart
Disclaimer: The articles reposted on this site are sourced from public platforms and are provided for informational purposes only. They do not necessarily reflect the views of MEXC. All rights remain with the original authors. If you believe any content infringes on third-party rights, please contact [email protected] for removal. MEXC makes no guarantees regarding the accuracy, completeness, or timeliness of the content and is not responsible for any actions taken based on the information provided. The content does not constitute financial, legal, or other professional advice, nor should it be considered a recommendation or endorsement by MEXC.

You May Also Like

Why African countries are using data protection laws as backdoor to regulate AI

Why African countries are using data protection laws as backdoor to regulate AI

Rather than waiting for comprehensive AI frameworks, which are often complex and slow to develop, governments across the continent are embedding AI-related rules
Share
Techcabal2026/03/19 18:46
YieldMax Funds Explained: How These ETFs Work, What They Pay & The Hidden Risks

YieldMax Funds Explained: How These ETFs Work, What They Pay & The Hidden Risks

If you have spent any time in income-investing circles recently, you have almost certainly come across YieldMax funds the ETFs promising yields of 30%, 50%, or
Share
Fintechzoom2026/03/19 18:14
Aster Price Surges After Airdrop and CZ Mention

Aster Price Surges After Airdrop and CZ Mention

The post Aster Price Surges After Airdrop and CZ Mention appeared on BitcoinEthereumNews.com. Aster, previously referred to as APX, witnessed its token price soar on September 18, rising by over 360% in one day. The surge followed after the project started its airdrop program and from CZ. What’s Driving Aster Price Surge The token’s steep price action came after the token’s airdrop began, and it will run until October 17. Approximately 704 million tokens representing approximately 8.8% of the total supply are being sent to eligible users. These include members of Aster’s Spectra Stage 0 and 1 programs, owners of Aster Gems, and traders of Aster Pro. Adding fuel to the charge, CZ publicly congratulated the Aster team, further increasing visibility to the project. That validation, combined with the token distribution, driven the price surge. Fundamentals Behind the Rally Beyond the frenzy, Aster’s fundamentals have been improving. Based on statistics provided by DeFi Llama. Its perpetual futures platform has seen more than $12 billion worth of trading volume this month, an increase from $9.78 billion in August and $8.5 billion last July. Revenue has increased steeply as well. Fees earned this quarter total $8.82 million, up from only $1.8 million during the same time last year. In Q3 2024, Aster had only generated $11,660 in revenue, but today that number is up to $5.4 million. The total value locked (TVL) in the protocol has hit a record high of $1.85 billion, an astronomical increase from $141 million in January. What’s Next for Aster Analysts believe that the rally may prevail since Aster is now becoming available on additional exchanges, yet it is mainly traded on its own platform. Yet with recipients of the airdrop likely to take profits in place, there will be some pressure selling. Like other recently listed coins like WLFI, Spark, and Avantis, a good starting run will be followed

Share
BitcoinEthereumNews2025/09/19 08:30