MARC View

000		00398nam a2200157Ia 4500
005		20250919153850.0
008		250729s9999 xx 000 0 und d
020		_a9780262193986
040		_cnmit
100		_aRichard S Sutton
245	0	_aReinforcement Learning _ban introduction
260		_aLondon _bMIT Press _c1998
300		_axviii, 322
700		_aAndrew G Bato
942		_cBK _2ddc
999		_c11983 _d11983