Home | About | Sematext search-lucene.com search-hadoop.com
 Search Lucene and all its subprojects:

Switch to Plain View
Mahout, mail # dev - Review Request: New implementation for LDA: Collapsed Variational Bayes (0th derivative approximation), with map-side model caching


+
Jake Mannix 2011-11-27, 20:37
+
Jake Mannix 2011-11-30, 07:35
+
Jake Mannix 2011-11-30, 08:58
+
Jake Mannix 2011-11-30, 18:42
Copy link to this message
-
Re: Review Request: New implementation for LDA: Collapsed Variational Bayes (0th derivative approximation), with map-side model caching
Jake Mannix 2011-12-01, 03:44

-----------------------------------------------------------
This is an automatically generated e-mail. To reply, visit:
https://reviews.apache.org/r/2944/
-----------------------------------------------------------

(Updated 2011-12-01 03:44:25.140987)
Review request for mahout and Ted Dunning.
Changes
-------

VectorDumper becomes a "top-terms" dumper as well.
Summary
-------

See MAHOUT-897
This addresses bug MAHOUT-897.
    https://issues.apache.org/jira/browse/MAHOUT-897
Diffs (updated)
-----

  trunk/core/src/main/java/org/apache/mahout/clustering/lda/LDADriver.java 1208933
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/LDASampler.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CVB0DocInferenceMapper.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CVB0Driver.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CVB0TopicTermVectorNormalizerMapper.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CachingCVB0Mapper.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/CachingCVB0PerplexityMapper.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/InMemoryCollapsedVariationalBayes0.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/ModelTrainer.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/clustering/lda/cvb/TopicModel.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/common/MemoryUtil.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/common/Pair.java 1208933
  trunk/core/src/main/java/org/apache/mahout/math/DistributedRowMatrixWriter.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/math/MatrixUtils.java PRE-CREATION
  trunk/core/src/main/java/org/apache/mahout/math/stats/Sampler.java PRE-CREATION
  trunk/core/src/test/java/org/apache/mahout/clustering/ClusteringTestUtils.java 1208933
  trunk/core/src/test/java/org/apache/mahout/clustering/lda/TestMapReduce.java 1208933
  trunk/core/src/test/java/org/apache/mahout/clustering/lda/cvb/TestCVBModelTrainer.java PRE-CREATION
  trunk/core/src/test/java/org/apache/mahout/math/stats/SamplerTest.java PRE-CREATION
  trunk/integration/src/main/java/org/apache/mahout/utils/vectors/VectorDumper.java 1208933
  trunk/integration/src/main/java/org/apache/mahout/utils/vectors/VectorHelper.java 1208933
  trunk/math/src/main/java/org/apache/mahout/math/AbstractVector.java 1208933
  trunk/math/src/main/java/org/apache/mahout/math/NamedVector.java 1208933
  trunk/src/conf/driver.classes.props 1208933

Diff: https://reviews.apache.org/r/2944/diff
Testing
-------

mvn clean test
Thanks,

Jake

+
Jake Mannix 2011-12-02, 20:49
+
Ted Dunning 2011-11-28, 07:54
+
Jake Mannix 2011-11-28, 17:58
+
Jake Mannix 2011-11-28, 17:55