FilterHN

DeepSeekMath-V2: Towards Self-Verifiable Mathematical Reasoning [pdf]

29 points

by fspeech

1 hour ago

| past

| 4 comments

| github.com

| HN

▲

awei

48 minutes ago

[-]

Something weird here, why is it so hard to have a deterministic program capable of checking a proof or anything math related, aren't maths super deterministic when natural language is not. From first principles, it should be possible to do this without a llm verifier.

▲

JacobiX

19 minutes ago

[-]

I think that mathematical proofs, as they are actually written, rely on natural language and on a large amount of implicit shared knowledge. They are not formalized in the Principia Mathematica sense, and they are even further from the syntax required by modern theorem provers. Even the most rigorous proofs such as those in Bourbaki are not directly translatable into a fully formal system.

▲

xemdetia

14 minutes ago

[-]

Maths can be super deterministic but often difficult to compute because of concepts like inferring by induction. I had to personally unlearn and rebase my understanding of math based in computation to 'get' pure maths. Another example is set building. You often don't need to compute the existence of members of sets in pure math you just need to agree that there are some members of a set that meet the criteria. How many or how many things that aren't in the set aren't meaningful often times to accept something and move on with the proof. From the computing perspective this can be difficult to put together.

▲

jebarker

21 minutes ago

[-]

I haven’t read the paper yet, but I’d imagine the issue is converting the natural language generated by the reasoner into a form where a formal verifier can be applied.

▲

riku_iki

25 minutes ago

[-]

such high performance program indeed could potentially be superior, if it would exist (this area is very undeveloped, there is no existing distributed well established solution which could handle large domain) and math would be formalized in that program's dsl, which also didn't happen yet.

▲

zaxioms

53 minutes ago

[-]

It's cool, but I genuinely cannot fathom why they are targeting natural language proofs instead of a proof assistant.

▲

mamami

31 minutes ago

[-]

Natural language is a lot more, well, readable than say lean. You get a lot less intuition and understanding of what the model is attempting to do in the first place.

▲

agentultra

36 minutes ago

[-]

So it's designed for informal proofs and it "verifies" based on a rubric fitting function and human interaction, is that right?

What's the use case for a system like this?

▲

photon_lines

1 hour ago

[-]

Exciting stuff from a fantastic team.