Tests

Note: these tests are running on a shared server, so that times may vary between runs, and they are substantially [an order of magnitude] slower than on a comparable dedicated machine.

In some cases we have not sought to supply the shortest proof (from fewest premises), but have tested the prover's 'discernment' by supplying unneeded premises.

Note: A few of these tests are deliberate disproofs, as a check on soundness. These tests are positively proved invalid and are noted as such when they are run.

Tau accepts input in KIF (Knowledge Interchange Format), a first-order notation in ASCII. Documentation for KIF is found here:

See also a list of various axioms which may be pasted into the prover. Note that some proofs in Peano arithmetic and Presburger arithmetic require that Tau apply mathematical induction (see the checkbox) or use previously proved results, while others do not. We leave it as an exercise for the reader to determine which. See the Wikipedia articles, Peano arithmetic and Presburger arithmetic.


Montague, Richard and Donald Kalish
Logic, Techniques of Formal Reasoning
New York, Harcourt, Brace and World, Inc., 1964.

T247
T248
T253
T254
T255
T256
T258
T259
T260
T261
T263
T264
T265
T266
T267
T268
T269
T270
T271
T272
T273
T301
T302
T303
T304
T305
T306
T307
T308
T309
T310
T311
T312
T313
T314
T315
T316
T317
T318
T321
T322
T323
T324
T325
T326
T327
T319



Montague, Richard , Kalish, Donald, and Mar, Gary (Ed. Robert Fogelin)
Logic: Techniques of Formal Reasoning
Harcourt Brace, New York, 1980

T2-239
T2-240
T2-241
T2-247
T2-248
T2-249
T2-250
T2-251
T2-252
T2-253
T2-256
T2-259
T2-261
T2-263
T2-264
T2-272
T2-312
T2-314
T2-315
T2-320
T2-321
T2-322
T2-324
T2-325
T2-326
T2-327
T2-328
T2-331
T2-332
T2-335
T2-323
       

Thomas, James A.
Symbolic Logic
Charles E. Merrill Publishing Co., Columbus, Ohio, 1977

V.5E11
V.5E12
V.5Ex1
V.7E17
V.7E18
V.7E19
Th262-9
Th266-b
V.5E10

Simple identity tests

ID001
ID002
ID003
ID004
ID005
ID006
ID007
T001
T002
T003
T004
T005
T006


Miscellaneous tests

J001
J002
J003
J004
J005
J006
J007
J008
J009
J011
J015
J016
J017
J018
J019
J020
Voters
J266
TrapIA
NRV716
In001
Los001
The Head of a Horse
Donkey
CP
simpletest
Theorem Sample 1
Theorem Sample 2
Theorem Sample 3
Argument Sample 1
Argument Sample 2
Argument Sample 3
JL

Tests adapted from Enderton, Herbert B.
A Mathematical Introduction to Logic, 2nd Ed.
Harcourt Academic Press, New York, 2001

Simple Peano Arithmetic tests (no induction used)

Peano axioms:

(forall ?X (not (= 0 (succ ?X))))
(forall ?X (forall ?Y (=> (= (succ ?X) (succ ?Y)) (= ?X ?Y))))
(forall ?X (= (+ ?X 0) ?X))
(forall ?X (forall ?Y (= (+ ?X (succ ?Y)) (succ (+ ?X ?Y)))))
(forall ?X (forall ?Y (= (* 0 ?X) 0)))
(forall ?X (forall ?Y (= (* ?X (succ ?Y)) (+ (* ?X ?Y) ?X))))

PA02V
 PA03V
PA007
PA04V
PA05V
PA06V
CMMult
PA11
Sch11.16
Sch11.15
PA001
PA002
PA003
PA004
PA005
PA101
PA102
PA_JH02a
PA_JH02b
PA_JH03
PA_JH04
PA_JH05
PA_JH06

Peano Arithmetic (PA, with mathematical induction)

Induction (all instances of):

(=> (and (F 0) (forall ?X (=> (F ?X) (F (succ ?X)))) (forall ?X (F ?X))))

Commutativity axioms (proved with induction):

(forall ?X (forall ?Y (= (+ ?X ?Y) (+ ?Y ?X))))
(forall ?X (forall ?Y (= (* ?X ?Y) (* ?Y ?X))))

Sch11.19Ind

Sch11.18Ind
CMMultInd
Sch11.17Ind
PA12Ind
PA16Ind
PA17Ind
PAThm05Ind
PA18Ind
PA19Ind
PAThm02Ind
PAThm04Ind
PAThm06Ind

 

PA + < (PALT)

PALT01Ind
PALT02Ind

PA + Odd/Even (PAOE)

PAOE01
PAOE02
PAOE03Ind
PAOE04Ind
PAOE05
PAOE06
PAOE07Ind
PAOE08

 

Presburger Arithmetic (PrA)

(forall ?X (not (= 0 (succ ?X))))
(forall ?X (forall ?Y (=> (= (succ ?X) (succ ?Y)) (= ?X ?Y))))
(forall ?X (= (+ ?X 0) ?X))
(forall ?X (forall ?Y (= (+ ?X (succ ?Y)) (succ (+ ?X ?Y)))))

Induction (all instances of):

(=> (and (F 0) (forall ?X (=> (F ?X) (F (succ ?X)))) (forall ?X (F ?X)))) , where F represents any formula with one free variable.

AssocAdd
CMAdd
PrA01Ind
PrA02Ind
PrA03Ind
PrA04Ind
PrA05Ind
PrA07Ind
PrA08Ind
PrA12
PrA18Ind
PrA19Ind
PrA20Ind
PrA24
PrA25Ind
PrA44
PrA54Ind
PrA55Ind
PrA56
PrA59Ind

PrA + < (PrALT)

PrALEabc
PrALT01
PrALT02
PrALTabc
PrALT03
PrALT04
PrALT05
PrALT06
PrALT07
PrALT08Ind
PrALTabc
PrALT19
PrALT20Ind
PrALT21Ind
PrALT22Ind
PrALEabc
PrALT18j
PrALT18b
PrALT18l
PrALT18d
PrALT18e
PrALT18
PrALT18i
PrALT18h


PrA + Odd/Even (PrAOE)

PrAOE01
PrAOE02
PrAOE03
PrAOE04
PrAOE05
PrAOE06
PrAOE07
PrAOE08
PrAOE10
PrAOE12
PrAOE14Ind
PrAOE16Ind
PrAOE21
PrAOE21Ind
PrAOE32

Theory of successor (Succ)

Succ01Ind
Succ02



Tests from Nolt, J. and D. Rohatyn, A. Varzi
Schaum's Outline of Logic
McGraw-Hill, 1998

Graph theory tests

06a Enumerated

06a Ground

06b Enumerated

06b Ground

06c Generalized

06c Ground

06d Enumerated

06d Ground

07a

07b

07c

07d

08a

08b Enumerated

08b Ground

08c

08d
09a
09b

09c

09d

10a

10b

10c

10d

Commutative ordered field tests

From Kalish & Montague, 1st. ed. Chapter VIII page 280
See also Field Axioms

COF_T22
COF_T05
COF_T34
COF_T09
COF_T10
COF_T14
COF_T18

TPTP tests

TPTP-NUM015
TPTP-NUM016

Disproofs

KM158_6
KM158_7
KM160_12
Th266-c
Th267-f
Th267-g
Th267-gA
Th267-l
J010
J021
J022
J024
NT253


Unresolved proof attempts (invalid, but unable to disprove directly)

T2-320d (KM&M)
T2-325d (KM&M)
U001
U002
U002a

Hard tests (currently unprovable within Tau's normal resource limits)

T329 (K&M)

T328 (K&M)

T320 (K&M)

Do you have a good problem for us to test? Submit it here, in KIF or ?