Reply to: Deep reinforced learning heuristic tested on spin-glass ground states: The larger picture

Fan, Changjun; Shen, Mutian; Nussinov, Zohar; Liu, Zhong; Sun, Yizhou; Liu, Yang-Yu

doi:10.1038/s41467-023-41108-w

Download PDF

Matters Arising
Open access
Published: 14 September 2023

Reply to: Deep reinforced learning heuristic tested on spin-glass ground states: The larger picture

Changjun Fan¹^na1,
Mutian Shen²^na1,
Zohar Nussinov^2,3,
Zhong Liu¹,
Yizhou Sun ORCID: orcid.org/0000-0003-1812-6843⁴ &
…
Yang-Yu Liu ORCID: orcid.org/0000-0003-2728-4907^5,6

Nature Communications volume 14, Article number: 5659 (2023) Cite this article

1426 Accesses
1 Altmetric
Metrics details

Subjects

The Original Article was published on 14 September 2023

replying to S. Boettcher Nature Communications https://doi.org/10.1038/s41467-023-41106-y (2023)

Here we provide a comprehensive response to the Comment written by Stefan Boettcher. We argue that the Comment did not account for the fairness of the comparison between different methods in searching for the spin-glass ground states. We demonstrate that, with a reasonably larger number of initial spin configurations, our results agree with the asymptotic scaling form assumed by finite-size corrections.

3D Edwards-Anderson (EA) model

In Fig. 5 of our paper¹, we plotted the disorder-averaged energy per spin (denote as e₀) as a function of the number of initial spin configurations (denoted as n_initial) for different methods to benchmark those methods on large 3D EA Ising spin glass instances with Gaussian disorder. The Comment pointed out that DIRAC-SA (a variant of our DIRAC method) did not reach the ground states for those systems, as indicated by the large deviation of the three red points from the asymptotic scaling form assumed by finite-size corrections (FSC), see Fig. 1 of the Comment and this response letter. However, as we explicitly mentioned in the caption of Fig. 5 in our paper¹, we only ran all the tested algorithms up to a small n_initial = 2.0 × 10⁴, which is much smaller than the number required to reach the ground state, as reported in the literature. For instance, Ref. ² reported that, to reach the ground state for 3D, L = 10 systems, the parallel tempering (PT) method requires n_initial = 3.2 × 10⁷, which is 1600 times larger than the number of initial spin configurations we used. Such a big difference in terms of n_initial is certainly not inconsequential. We did not expect any of the methods to reach the ground state with n_initial = 2.0 × 10⁴ for large 3D EA instances with Gaussian disorder. Indeed, for 3D, L = 10 systems, with n_initial = 2.0 × 10⁴, PT and simulated annealing (SA) did not reach the expected ground state either (see the magenta and cyan points in Fig. 1 of this response). In fact, with the same n_initial, results of these two methods are even farther away from the FSC line than DIRAC-SA for 3D, L = 10 systems (see the third red point in Fig. 1 of this response). Without specifying the number of initial spin configurations, we think it is unfair and meaningless to compare different methods in searching for the ground states of large spin-glass instances.

**Fig. 1: With a reasonably large n_initial, our DIRAC-SA results agree well with larger picture suggested by FSC.**

In our paper¹ we did not try a larger n_initial for two reasons. First, we had already demonstrated the ability of DIRAC to reach the exact ground states for small systems (which can be confirmed by the branch-and-bound-based solver Gurobi), as shown in Fig. 4 of our paper¹. Second, we did not find it necessary to invest extensive computational resources in an “arms race” fashion of computing the “ground states” of these large systems for which exact solvers cannot confirm the results. Also, to achieve the (true) ground states the required n_initial may be exponential in the system size. There is no exception for DIRAC or any other heuristic methods. Our paper aimed to demonstrate the effectiveness and efficiency of DIRAC over other methods at the same n_initial, rather than to confirm the asymptotic scaling form assumed by FSC. We appreciate the “larger picture” mentioned in the Comment. But it was beyond the scope of our paper.

Since the Comment questioned the ability of our method to reach the ground state for large systems, we think it is necessary to perform heavier computations with a larger n_initial to directly address the Comment. For 3D, L = 10 systems with n = 50 instances, we found that, with n_initial = 6.5 × 10⁵, about 2% as that needed for PT, the average energy per spin computed by DIRAC-SA could indeed reach the asymptotic scaling form assumed by FSC (see the leftmost green point in Fig. 1). We also plotted e₀ computed by DIRAC-SA for 3D, L = 4, 5, 6, 7, 8, with n = 850, 900, 820, 120, 221 instances respectively, in the same figure. We found that they agree well with the FSC line. These results clearly demonstrate that the importance of using a large n_initial to achieve results consistent with the prediction of FSC. We are grateful that the Comment helped us clarify this point. As mentioned above, confirming the asymptotic scaling form assumed by FSC was not the original goal of our paper.

Sherrington-Kirkpatrick (SK) model

Fig. 2 of the Comment acknowledged that our results for the SK model are consistent with the asymptotic scaling form assumed by FSC, although in the figure we could still see a deviation from the FSC line for SK model of N = 64. We believe this deviation is simply due to the small number of instances (n = 50) used in our calculation. We notice that with n = 50 instances the results offered by the extremal optimization (EO) heuristic also deviate from the FSC line, especially for N = 125. We argue that DIRAC needs more instances to reach the FSC line, just like the EO case. After all, only the average over many different instances may be expected to behave as a smooth function of N³.

The Comment also pointed out that the system sizes we considered are relatively small. We emphasize that, as a reinforcement-learning framework based on graph neural network, DIRAC was not specifically designed for SK models with a complete graph topology. We believe that, to compute ground states for larger SK instances, DIRAC would have to be modified to explicitly consider the complete graph topology. However, this was beyond the scope of our paper.

Competitive methods

It is a pity that in our paper we did not explicitly cite any papers on the genetic algorithm^3,4 (GA) or extremal optimization (EO) heuristic^5,6,7. We did cite a book⁸ on the use of those heuristic methods for computing the spin-glass ground state though, as also pointed out by the Comment. In our paper, we did not compare the performance of DIRAC with that of GA and EO either. This is mainly because PT and GA were commonly used to compute the ground state of the EA Ising spin glass model with Gaussian disorder^2,9, and Ref. ⁹ reported that a simple PT algorithm performs as well as GA found in the literature. Hence, we chose PT as a competitive method of DIRAC. We did consider two classical heuristic methods: SA and Greedy algorithm. Overall, we think comparing DIRAC with those methods is sufficient to demonstrate its superiority.

Running time

In our main text, we primarily focused on comparing the value of n_initial among different methods. We believe this is a fair comparison since this metric remains unaffected by the computational environment, programming language, or system load during testing. It can also be interpreted as the number of ‘exploration steps’ taken by each algorithm, which, to some extent, reflects the algorithm’s level of ‘intelligence’. As an extreme example, Fig. 7 in our main text demonstrates that even a simple DIRAC¹ method can achieve the ground state of an anti-ferromagnetic model with the theoretically minimal number of exploration steps.

Nevertheless, we understand that some readers may inquire about the actual running time or ‘wall clock time’ of our algorithm. Therefore, we have provided two tables, Tables 1 and 2, which present the typical running times of DIRAC and SA on a laptop equipped with an Intel(R) Core(TM) i5-10400 processor and Nvidia(R) Geforce(R) RTX 2070 graphics card, and also a server equipped with an Intel(R) Xeon(R) Gold 6278C processor and Nvidia(R) Tesla(R) V100 graphic card. The running times of other algorithms, such as DIRAC-SA, DIRAC-PT or PT, can be roughly estimated based on these values. For instance, for n_initial = 5000, the time cost of DIRAC-SA is roughly the sum of 2500 DIRAC¹ and 2500 SA sweeps. Also, it is expected that the time required for an SA sweep and a PT sweep would not exhibit a significant difference.

Table 1 Average running time for n_initial = 1 on Intel(R) Core(TM) i5-10400 @2.9GHz and Nvidia(R) Geforce(R) RTX 2070

Full size table

Table 2 Average running time for n_initial = 1 on Intel(R) Xeon(R) Gold 6278C CPU @ 2.60GHz and Nvidia(R) Tesla(R) V100

Full size table

We acknowledge that our DIRAC code was not optimized for achieving the shortest running time. However, even in such case, in terms of the running time taken to reach the same energy, DIRAC’s running time is not at a disadvantage, if not in an advantageous position. For example, a comparative test was conducted on the same 3D, L = 10 systems for SA and DIRAC-SA. An average energy density of approximately −1.6956 can be achieved with 10⁴ SA (with n_initial = 5 × 10⁷), while reaching the same energy level only requires 47 DIRAC-SA (with n_initial = 2.35 × 10⁵). Even after taking into account the running time differences between DIRAC¹ and SA sweep shown in Table 2, we can estimate that the MATLAB version of DIRAC-SA is still ~2.5 times faster than the C++ version of SA. Despite the additional use of GPU, we believe that compared to SA, DIRAC can more naturally benefit from GPU acceleration, as the time consumption of DIRAC is primarily on matrix multiplication.

The running time of DIRAC is influenced by many factors, so there may still be room for improvement. In fact, during the development of DIRAC, we discovered a significant time overhead due to communication between C++ modules, the Tensorflow session, and the Python code. (As an indirect evidence, it can be observed that for this code, there is no significant difference in the running time between the RTX 2070 and V100 GPU). Hence, employing a unified programming language could greatly improve performance, as demonstrated by the MATLAB running times listed in Tables 1 and 2. In addition to these findings, we have identified several other ways to accelerate the code:

Implement the code in an incremental way. For instance, in the context of SA, when attempting to flip a spin, it is sufficient to compute the energy of that specific spin. However, in the current version of the DIRAC code, whenever the spin configuration is altered, all the Q values need to be recomputed, which is clearly not efficient. To improve this, we can modify the code to update only the affected Q values when a spin is flipped, rather than recomputing all of them. This incremental approach will optimize the computation process.
Matrix chain multiplication. In the current version of the DIRAC code, we did not optimize the order of the matrix multiplication. This could also possibly be a way to optimize the computation running time.
Programming language. We believe that if the entire code is written in C++/CUDA, the running time should be further reduced.

On the other hand, for the DIRAC¹ code written in MATLAB, the performance difference of GPUs is still very noticeable, compared to the insignificant differences in single-core performance among modern CPUs; for instance, see the SA sweep running time on different machines. For instance, when we replaced the RTX 2070 with the V100 server GPU, the running time was reduced by nearly 2–4 times. Furthermore, from the table, we can observe that for the DIRAC¹ code written in MATLAB, its time complexity appears to be even less than linear. This may suggest that the performance of GPUs is not fully utilized, at least in smaller systems. In general, we believe that DIRAC has significant potential for further development in terms of computational time.

Methods

The hyperparameters used in the DIRAC-SA algorithm mentioned in this paper are the same as the default hyperparameters in the GitHub code¹. In addition, the MATLAB version of DIRAC¹ that we used for the running time test has also been updated on GitHub¹. The details of the computing environments have been provided in the section “Running Time”.

Data availability

The data used to reproduce the results in this paper are publicly available¹⁰.

Code availability

The source code of DIRAC (and its variants), as well as the two baseline methods, SA and PT, are publicly available¹⁰ or on GitHub (https://github.com/FFrankyy/DIRAC.git).

References

Fan, C. et al. Searching for spin glass ground states through deep reinforcement learning. Nat. Commun. 14, 725 (2023).
Article ADS CAS PubMed PubMed Central Google Scholar
Wang, W., Machta, J. & Katzgraber, H. G. Comparing monte carlo methods for finding ground states of ising spin glasses: population annealing, simulated annealing, and parallel tempering. Phys. Rev. E 92, 013303 (2015).
Article ADS MathSciNet Google Scholar
Pál, K. F. The ground state of the cubic spin glass with short-range interactions of Gaussian distribution. Phys. A: Stat. Mechanics Appl. 233, 60–66 (1996).
Article ADS Google Scholar
Pál, K. F. The ground state energy of the Edwards-Anderson Ising spin glass with a hybrid genetic algorithm. Phys. A: Stat. Mechanics Appl. 223, 283–292 (1996).
Article ADS Google Scholar
Boettcher, S. Extremal optimization for Sherrington-Kirkpatrick spin glasses. Eur. Phys. J. B: Condens. Matter Complex Syst. 46, 501–505 (2005).
Article CAS Google Scholar
Boettcher, S. & Percus, A. G. Optimization with extremal dynamics. Phys. Rev. Lett. 86, 5211–5214 (2001).
Article ADS CAS PubMed MATH Google Scholar
Middleton, A. A. Improved extremal optimization for the Ising spin glass. Phys. Rev. E 69, 055701 (2004).
Article ADS Google Scholar
Hartmann, A. K. & Rieger, H. New optimization algorithms in physics. (2004).
Romá, F., Risau-Gusman, S., Ramirez-Pastor, A. J., Nieto, F. & Vogel, E. E. The ground state energy of the Edwards–Anderson spin glass model with a parallel tempering monte carlo algorithm. Phys. A: Stat. Mechanics Appl. 388, 2821–2838 (2009).
Article ADS Google Scholar
Fan, C. et al. Searching for spin glass ground states through deep reinforcement learning. Zenodo https://doi.org/10.5281/zenodo.7562380 (2023).
Boettcher, S. Deep reinforced learning heuristic tested on spin-glass ground states: The larger picture. Nat. Commun. https://doi.org/10.1038/s41467-023-41106-y (2023).

Download references

Acknowledgements

We wish to thank Stefan Boettcher for discussions and correspondence. We are grateful to L. Zeng for the valuable discussions. C.F. and Z.L. are supported by the National Natural Science Foundation of China (NSFC, 62206303, 62273352, 62073333, 72025405, 72088101), and the science and technology innovation Program of Hunan Province (2023RC3009).

Author information

These authors contributed equally: Changjun Fan, Mutian Shen.

Authors and Affiliations

College of Systems Engineering, National University of Defense Technology, Changsha, 410073, China
Changjun Fan & Zhong Liu
Department of Physics, Washington University in St. Louis, Campus Box 1105, 1 Brookings Drive, St. Louis, MO, 63130, USA
Mutian Shen & Zohar Nussinov
Rudolf Peierls Centre for Theoretical Physics, University of Oxford, Oxford, OX1 3PU, UK
Zohar Nussinov
Department of Computer Science, University of California, Los Angeles, CA, 90024, USA
Yizhou Sun
Channing Division of Network Medicine, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA, 02115, USA
Yang-Yu Liu
Center for Artificial Intelligence and Modeling, The Carl R. Woese Institute for Genomic Biology, University of Illinois at Urbana-Champaign, Champaign, IL, 61820, USA
Yang-Yu Liu

Authors

Changjun Fan
View author publications
You can also search for this author in PubMed Google Scholar
Mutian Shen
View author publications
You can also search for this author in PubMed Google Scholar
Zohar Nussinov
View author publications
You can also search for this author in PubMed Google Scholar
Zhong Liu
View author publications
You can also search for this author in PubMed Google Scholar
Yizhou Sun
View author publications
You can also search for this author in PubMed Google Scholar
Yang-Yu Liu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Y.-Y.L. conceived and designed the project. Y.-Y.L. and Y.S. managed the project. C.F. and M.S. performed all the numerical calculations and analyzed the results, Y.-Y.L., Y.S., Z.N., and Z.L. interpreted the results. C.F., M.S., and Y.-Y.L. wrote the paper, Y.S. and Z.N. edited the paper.

Corresponding authors

Correspondence to Yizhou Sun or Yang-Yu Liu.

Ethics declarations

Competing interests

The authors declare no competing interests.

Peer review

Peer review information

Nature Communications thanks the anonymous, reviewer(s) for their contribution to the peer review of this work.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Fan, C., Shen, M., Nussinov, Z. et al. Reply to: Deep reinforced learning heuristic tested on spin-glass ground states: The larger picture. Nat Commun 14, 5659 (2023). https://doi.org/10.1038/s41467-023-41108-w

Download citation

Received: 12 May 2023
Accepted: 21 August 2023
Published: 14 September 2023
DOI: https://doi.org/10.1038/s41467-023-41108-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.