tensorflow-haskell

mirror of https://github.com/tensorflow/haskell.git synced 2025-04-28 08:45:24 +02:00

Author	SHA1	Message	Date
fkm3	baa501b262	Use newer version of stack in CI (#189 ) Required by #187. The version we were using is old enough that it doesn't work with the latest stackage LTS. haskellstack.org says There is also a Ubuntu package for Ubuntu 16.10 and up, but the distribution's Stack version lags behind, ... So, instead of asking them to update it, it's probably better to download the tar of the version we want. Somehow updating stack surfaced a new pedantic warning in GradientTest, so I've fixed that as well.	2018-05-15 23:19:15 -04:00
fkm3	1e2dca8701	Update to tensorflow 1.7 (#185 ) All of the non-s/1.3/1.7/ changes are because * There are new tensorflow datatypes * Some ops have looser types (e.g. fill now accepts both int64 and int32) * There are more ops of type "func"	2018-04-17 12:24:31 -04:00
fkm3	e35211d49b	Fix initialized variables for tensorflow 1.7 (#184 ) * Fix initialized variables for tensorflow 1.7 This is needed to support tensorflow 1.7. The trick of initializing a variable with `Shape []` and then overriding the shape by assigning an initial value no longer works. It seems that we need to explicitly flip the unknown_rank bit in the shape proto. I thought about switching opgen to use `Maybe Shape` when an op requires a shape attribute, but that will cause a lot of api churn, so I chose to hold off for now and just do a spot fix to unblock 1.7.	2018-04-16 07:48:05 -07:00
Christian Berentsen	2dcc921f6e	Gradient of Conv2DBackpropInput (#155 )	2017-10-15 11:49:44 -07:00
Nathan T.A. Lewis	d79a919efa	Fix a small typo in the warning message of numOutputs (#151 )	2017-08-24 17:34:22 -04:00
Jonathan Kochems	79d8d7edea	Adding gradient for Concat (#144 )	2017-07-29 23:29:33 -04:00
fkm3	cac45d1cd6	Delete inaccurate comments	2017-07-25 09:38:00 -04:00
Christian Berentsen	bebc4aa7d9	Add gradient of 'maximum' and 'gradForBinaryCwise' `maximum` gradient uses `gradForBinaryCwise` which may be useful for other binary componentwise op gradients	2017-07-25 00:14:23 -04:00
Christian Berentsen	ea30577264	Gradient for AddN	2017-07-25 00:06:10 -04:00
Christian Berentsen	4ab9cb9cf2	Moved reduceMean to Ops (#136 )	2017-06-20 20:50:46 -07:00
fkm3	0603a6987b	Add Minimize module with gradient descent and adam implementations (#125 )	2017-05-25 19:19:22 -07:00
fkm3	a86d424cac	Add initializedValue function for Variable (#124 )	2017-05-20 21:42:45 -07:00
fkm3	b86945f008	Support Variable in TensorFlow.Gradient and use in mnist example (#116 )	2017-05-17 13:20:51 -07:00
fkm3	ddb4fe4f90	Add ToTensor class	2017-05-16 23:05:33 -07:00
fkm3	0f04e5a50d	Expand Rendered class to support ResourceHandle wrappers like Variable This allows functions like `feed`, `colocateWith`, and (in a later commit) `gradients` to work with `Variable`.	2017-05-15 19:55:34 -07:00
Judah Jacobson	64971c876a	Consolidate some packages. (#111 ) - Merge tensorflow-nn and tensorflow-queue into tensorflow-ops. They don't add extra dependencies and each contain a single module, so I don't think it's worth separating them at the package level. - Remove google-shim in favor of direct use of test-framework.	2017-05-10 15:26:03 -07:00
Judah Jacobson	0fa719b701	Fix .cabal files so 'stack check' passes. (#110 ) - Add LICENSE files for all packages. - Add descriptions for packages that were missing one. - Work around google/proto-lens#69 by symlinking third_party into tensorflow-proto.	2017-05-10 11:37:00 -07:00
Judah Jacobson	ff5f1cccf4	Increase the number of iterations for MatrixTest. (#107 ) The number of iterations was reduced from 1000 to 300 during review, but that turned out to be too low and the test now fails about 20% of the time. After changing it back to 1000, the test succeeded at 50 out of 50 runs.	2017-05-09 09:54:09 -07:00
Judah Jacobson	a64af5076a	Work around #92 by always copying TensorData when fetching. It would be better to avoid the copy when it's not necessary, but that will require more involved changes to the internal API. (For example, Fetchable might need to allow IO or ST actions.)	2017-05-09 00:10:29 -07:00
Jarl Christian Berentsen	37e3c9b084	Whitespace	2017-05-05 16:49:27 -07:00
Jarl Christian Berentsen	d153d0aded	Fixed matMul gradients for transposed arguments	2017-05-05 16:49:27 -07:00
Jarl Christian Berentsen	51014a015c	Implemented TileGrad Some notes about static shape inference	2017-05-05 16:49:27 -07:00
Jarl Christian Berentsen	97b4bb5bab	Added reduceSum to Ops	2017-05-05 16:49:27 -07:00
Christian Berentsen	eca4ff8981	Implemented ReluGradGrad and FillGrad (#102 ) Added testReluGrad, testReluGradGrad and testFillGrad	2017-04-30 11:18:06 -07:00
Chris Mckinlay	09c792b84c	added matrix factorization test (#101 )	2017-04-27 17:05:34 -07:00
Judah Jacobson	51c883684b	Clarify the behavior of readValue in a comment. (#99 ) Also add a unit test corresponding to that comments' example code.	2017-04-16 15:31:26 -07:00
Judah Jacobson	42f4fc647e	Add resource-based variable ops. (#98 ) The main difference between these and the `Ref`-bases ops is the explicit `readValue` op. I'm not sure how this should interact with gradients and save/restore, so I'm keeping it as a separate module for now. Once we figure out the details, we can merge it into `TensorFlow.Ops` and replace all uses of the old `Ref`-based ops. (That would also fix #92.) Also replaces our special case newtype `ResourceHandle` to `Tensor Value ResourceHandle`, where `ResourceHandle` is the TF proto corresponding to `DT_RESOURCE`.	2017-04-16 09:24:02 -07:00
Christian Berentsen	21b723d542	Adapt to lts-8.6 and use proto-lens-0.2.0.1 (#97 )	2017-04-11 14:09:01 -07:00
Judah Jacobson	d62c614695	Distinguish between "rendered" and "unrendered" Tensors. (#88 ) Distinguish between "rendered" and "unrendered" Tensors. There are now three types of `Tensor`: - `Tensor Value a`: rendered value - `Tensor Ref a`: rendered reference - `Tensor Build a` : unrendered value The extra bookkeeping makes it easier to track (and enforce) which tensors are rendered or not. For examples where this has been confusing in the past, see With this change, pure ops look similar to before, returning `Tensor Build` instead of `Tensor Value`. "Stateful" (monadic) ops are unchanged. For example: add :: OneOf [..] t => Tensor v'1 t -> Tensor v'2 t -> Tensor Build t assign :: (MonadBuild m, TensorType t) => Tensor Ref t -> Tensor v'2 t -> m (Tensor Ref t) The `gradients` function now requires that the variables over which it's differentiating are pre-rendered: gradients :: (..., Rendered v2) => Tensor v1 a -> [Tensor v2 a] -> m [Tensor Value a] (`Rendered v2` means that `v2` is either a `Ref` or a `Value`.) Additionally, the implementation of `gradients` now takes care to render every intermediate value when performing the reverse accumulation. I suspect this fixes an exponential blowup for complicated expressions.	2017-04-06 15:10:33 -07:00
Judah Jacobson	a11a417ad5	Add another test of CSE and feeds. (#87 ) As a follow-up to #86, check that our CSE isn't too aggressive to prevent feeds of pure ops with distinct names.	2017-03-23 12:58:40 -07:00
Judah Jacobson	fdbfd050f8	Prevent CSE of placeholder ops. (#86 ) The bug was introduced in #84.	2017-03-22 22:47:42 -07:00
Judah Jacobson	c99a23b6a7	Add versions of each op that take optional params as an extra arg. (#84 ) Each op `foo :: ...` now has a corresponding `foo' :: OpParams -> ...` which lets you set optional attributes. `OpParams` is currently a type alias for `OpDef -> OpDef`. In the future we should consider more type safety, e.g., using type-level strings and OverloadedLabels for optional attributes. I used it to replace a few manual `buildOp`s in our code with the codegenerated ops, now that it's easier to set attributes. I also removed `tensorAttr` and `named` since it's now possible to set those op attributes directly. Although this clutters up the API a bit, I think it's simpler than using type classes to implement optional arguments (as in, for example, `Text.Printf`) -- especially in terms of type inference with the rest of the library.	2017-03-20 18:16:38 -07:00
Judah Jacobson	2c5c879037	Introduce a MonadBuild class, and remove `buildAnd`. (#83 ) This change adds a class that both `Build` and `Session` are instances of: class MonadBuild m where build :: Build a -> m a All stateful ops (generated and manually written) now have a signature that returns an instance of `MonadBuild` (rather than just `Build`). For example: assign_ :: (MonadBuild m, TensorType t) => Tensor Ref t -> Tensor v t -> m (Tensor Ref t) This lets us remove a bunch of spurious calls to `build` in user code. It also lets us replace the pattern `buildAnd run foo` with the simpler pattern `foo >>= run` (or `run =<< foo`, which is sometimes nicer when foo is a complicated expression). I went ahead and deleted `buildAnd` altogether since it seems to lead to confusion; in particular a few tests had `buildAnd run . pure` which is actually equivalent to just `run`.	2017-03-18 12:08:53 -07:00
fkm3	4fb68f3aa3	Add example to README + make haddock link more prominent (#60 )	2017-01-16 20:44:45 -08:00
fkm3	f170df9d13	Support fetching storable vectors + use them in benchmark (#50 ) In addition, you can now fetch TensorData directly. This might be useful in scenarios where you feed the result of a computation back in, like RNN. Before: benchmarking feedFetch/4 byte time 83.31 μs (81.88 μs .. 84.75 μs) 0.997 R² (0.994 R² .. 0.998 R²) mean 87.32 μs (86.06 μs .. 88.83 μs) std dev 4.580 μs (3.698 μs .. 5.567 μs) variance introduced by outliers: 55% (severely inflated) benchmarking feedFetch/4 KiB time 114.9 μs (111.5 μs .. 118.2 μs) 0.996 R² (0.994 R² .. 0.998 R²) mean 117.3 μs (116.2 μs .. 118.6 μs) std dev 3.877 μs (3.058 μs .. 5.565 μs) variance introduced by outliers: 31% (moderately inflated) benchmarking feedFetch/4 MiB time 109.0 ms (107.9 ms .. 110.7 ms) 1.000 R² (0.999 R² .. 1.000 R²) mean 108.6 ms (108.2 ms .. 109.2 ms) std dev 740.2 μs (353.2 μs .. 1.186 ms) After: benchmarking feedFetch/4 byte time 82.92 μs (80.55 μs .. 85.24 μs) 0.996 R² (0.993 R² .. 0.998 R²) mean 83.58 μs (82.34 μs .. 84.89 μs) std dev 4.327 μs (3.664 μs .. 5.375 μs) variance introduced by outliers: 54% (severely inflated) benchmarking feedFetch/4 KiB time 85.69 μs (83.81 μs .. 87.30 μs) 0.997 R² (0.996 R² .. 0.999 R²) mean 86.99 μs (86.11 μs .. 88.15 μs) std dev 3.608 μs (2.854 μs .. 5.273 μs) variance introduced by outliers: 43% (moderately inflated) benchmarking feedFetch/4 MiB time 1.582 ms (1.509 ms .. 1.677 ms) 0.970 R² (0.936 R² .. 0.993 R²) mean 1.645 ms (1.554 ms .. 1.981 ms) std dev 490.6 μs (138.9 μs .. 1.067 ms) variance introduced by outliers: 97% (severely inflated)	2016-12-14 18:53:06 -08:00
fkm3	91f508eb5c	Fix TensorData encode and decode for Bool (#49 )	2016-12-12 19:40:32 -08:00
fkm3	cc08520dc7	Fix gradients calculation for min and max (#48 )	2016-12-12 09:47:02 -08:00
Judah Jacobson	1539783ee5	Update type constraints to work around a ghc-8 bug. (#47 ) Also removes all the ghc-8-specific logic in the .cabal files. ghc-8 has issues with deeply nested tuples of constraints. We can work around it by: - Changing TensorTypes to a regular class. This required FlexibleContexts. (But we'll probably need it anyway when we support heterogeneous tensor lists.) - Specializing NoneOf for long type lists. For more details, see: https://ghc.haskell.org/trac/ghc/ticket/12175. Also added 'directory' to tensorflow-core-ops' dependencies since it's used in the Setup script. One more step towards fixing #38.	2016-11-28 21:15:09 -08:00
Judah Jacobson	5b4017e31b	Fix the build on ghc-8.0.1 (#38 ). (#40 ) Two issues: - The definition of `\\` was missing parentheses. It was probably a bug that this used to worked in ghc-7.10. - Set `-fconstraint-solver-iterations=0` to work around https://ghc.haskell.org/trac/ghc/ticket/12175. It looks like we can trigger that bug when defining a significantly complicated op. Specifically, our type shenanigans ("OneOf") along with lens setters (for OpDef) seem to confuse GHC. Still TODO: automate testing of different ghc versions to prevent a regression.	2016-11-21 22:20:08 -08:00
Judah Jacobson	cec666e135	Fix Ref and Build semantics for generated code. (#37 ) Also: - Make TensorFlow.Ops.{variable,assign} be the Core generated versions. - Make ops take "Shape" as mandatory input.	2016-11-21 10:19:15 -08:00
Judah Jacobson	a277c7ddb3	Refactor OpGen. (#36 ) Also fixes op lists when the same attribute specifies the length of both an input and an output. I added a test of "shapeN" which previously failed with the following error: ERROR: Ran out of counts in toResult. Likely misuse of buildListOp.	2016-11-20 10:00:22 -08:00
Greg Steuck	2b5e41ffeb	Make code --pedantic (#35 ) * Enforce pedantic build mode in CI. * Our imports drifted really far from where they should be.	2016-11-18 10:42:02 -08:00
Noon van der Silk	69fdbf677f	test case to show can't calculate grad for embedding (and associated fix) (#23 ) * Fix for embedding gradient calculation - Passes vectors instead of scalars to slice - converts the numRows to a scalar - add `toScalar` utility function - minor change to test case so that it actually works * added lib for testing helper functions * add flatSlice function	2016-11-17 13:54:36 -08:00
fkm3	fc3d398ca9	Optimize fetching (#27 ) * Add MNIST data to gitignore * Add simple tensor round-trip benchmark * Use deepseq + cleaner imports * Use safe version of fromIntegral in FFI code * Don't copy data when fetching tensors BEFORE benchmarking feedFetch/4 byte time 55.79 μs (54.88 μs .. 56.62 μs) 0.998 R² (0.997 R² .. 0.999 R²) mean 55.61 μs (55.09 μs .. 56.11 μs) std dev 1.828 μs (1.424 μs .. 2.518 μs) variance introduced by outliers: 34% (moderately inflated) benchmarking feedFetch/4 KiB time 231.4 μs (221.9 μs .. 247.3 μs) 0.988 R² (0.974 R² .. 1.000 R²) mean 226.6 μs (224.1 μs .. 236.2 μs) std dev 13.45 μs (7.115 μs .. 27.14 μs) variance introduced by outliers: 57% (severely inflated) benchmarking feedFetch/4 MiB time 485.8 ms (424.6 ms .. 526.7 ms) 0.998 R² (0.994 R² .. 1.000 R²) mean 515.7 ms (512.5 ms .. 517.9 ms) std dev 3.320 ms (0.0 s .. 3.822 ms) variance introduced by outliers: 19% (moderately inflated) AFTER benchmarking feedFetch/4 byte time 53.11 μs (52.12 μs .. 54.22 μs) 0.996 R² (0.995 R² .. 0.998 R²) mean 54.64 μs (53.59 μs .. 56.18 μs) std dev 4.249 μs (2.910 μs .. 6.076 μs) variance introduced by outliers: 75% (severely inflated) benchmarking feedFetch/4 KiB time 83.83 μs (82.72 μs .. 84.92 μs) 0.999 R² (0.998 R² .. 0.999 R²) mean 83.82 μs (83.20 μs .. 84.35 μs) std dev 1.943 μs (1.557 μs .. 2.614 μs) variance introduced by outliers: 20% (moderately inflated) benchmarking feedFetch/4 MiB time 95.54 ms (93.62 ms .. 97.82 ms) 0.999 R² (0.998 R² .. 1.000 R²) mean 96.61 ms (95.76 ms .. 97.51 ms) std dev 1.408 ms (1.005 ms .. 1.889 ms)	2016-11-17 10:41:49 -08:00
Greg Steuck	0d4f5a9628	Added sessionTracer to log graph operations. (#26 ) * Added TracingTest.	2016-11-14 15:14:51 -08:00
Greg Steuck	d9115c716f	genericLength is too generic. Avoid folding in TF.	2016-11-09 14:20:26 -08:00
Greg Steuck	ec5c5228e1	Fixed #19 by adding previously missing reshape. The comment did say that only flat shapes were supported though.	2016-11-09 11:54:53 -08:00
silky	9c81241439	Tests for "embedding_lookup" and minor fix - added a test that fails for a partitioned embedding - added a test that passes for a single embedding	2016-11-09 16:21:40 +11:00
Greg Steuck	4ec78a8fca	Replaced topK with topKV2. (#21 ) topK is obsolete and generating warnings.	2016-11-08 20:57:22 -08:00
fkm3	03a3a6d086	Misc MNIST example cleanup (#9 ) * Use native oneHot op in the example code. It didn't exist when this was originally written. * Misc cleanup in MNIST example - Use unspecified dimension for batch size in model. This simplifies the code for the test set. - Move error rate calculation into model.	2016-10-26 11:14:38 -07:00

1 2

51 commits