Hi,

I had this error during crawling a Samba hosted on Ubuntu Server :
ERROR 2017-10-05 00:00:14,109 (Idle cleanup thread) - MCF|MCF-agent|apache.manifoldcf.crawlerthreads|Exception tossed: Service '_ANON_0' of type '_REPOSITORYCONNECTORPOOL_SmbFileShare' is not active
org.apache.manifoldcf.core.interfaces.ManifoldCFException: Service '_ANON_0' of type '_REPOSITORYCONNECTORPOOL_SmbFileShare' is not active
at org.apache.manifoldcf.core.lockmanager.BaseLockManager.updateServiceData(BaseLockManager.java:273)
at org.apache.manifoldcf.core.lockmanager.LockManager.updateServiceData(LockManager.java:108)
at org.apache.manifoldcf.core.connectorpool.ConnectorPool$Pool.pollAll(ConnectorPool.java:654)
at org.apache.manifoldcf.core.connectorpool.ConnectorPool.pollAllConnectors(ConnectorPool.java:338)
at org.apache.manifoldcf.crawler.repositoryconnectorpool.RepositoryConnectorPool.pollAllConnectors(RepositoryConnectorPool.java:124)
at org.apache.manifoldcf.crawler.system.IdleCleanupThread.run(IdleCleanupThread.java:68)

I used MCF 2.8.1 on Debian 8 with Postgresql 9.5.3, Windows Share repository connector. The job was configured to process about 2 millions of files  (600 GB).
For text extraction I used a Tika server (on the same server as MCF) and add the Tika external content extractor transformation connector into the job configuration.
The error was present 9 hours after the job was launched. The status job still indicated that the job was running but there was only 1 document in the active column and the error above was repeated in the MCF log.

Then I tried to launch the clean-lock.sh script and I obtained this error :
WARN 2017-10-09 08:23:56,284 (Idle cleanup thread) - MCF|MCF-agent|apache.manifoldcf.lock|Attempt to set file lock 'mcf/mcf_home/./syncharea/551/442/lock-_POOLTARGET__REPOSITORYCONNECTORPOOL_SmbFileShare.lock' failed: No such file or directory
java.io.IOException: No such file or directory
at java.io.UnixFileSystem.createFileExclusively(Native Method)
at java.io.File.createNewFile(File.java:1012)
at org.apache.manifoldcf.core.lockmanager.FileLockObject.grabFileLock(FileLockObject.java:223)
at org.apache.manifoldcf.core.lockmanager.FileLockObject.obtainGlobalWriteLockNoWait(FileLockObject.java:78)
at org.apache.manifoldcf.core.lockmanager.LockObject.obtainGlobalWriteLock(LockObject.java:121)
at org.apache.manifoldcf.core.lockmanager.LockObject.enterWriteLock(LockObject.java:74)
at org.apache.manifoldcf.core.lockmanager.LockGate.enterWriteLock(LockGate.java:177)
at org.apache.manifoldcf.core.lockmanager.BaseLockManager.enterWrite(BaseLockManager.java:1120)
at org.apache.manifoldcf.core.lockmanager.BaseLockManager.enterWriteLock(BaseLockManager.java:757)
at org.apache.manifoldcf.core.lockmanager.LockManager.enterWriteLock(LockManager.java:302)
at org.apache.manifoldcf.core.connectorpool.ConnectorPool$Pool.pollAll(ConnectorPool.java:585)
at org.apache.manifoldcf.core.connectorpool.ConnectorPool.pollAllConnectors(ConnectorPool.java:338)
at org.apache.manifoldcf.crawler.repositoryconnectorpool.RepositoryConnectorPool.pollAllConnectors(RepositoryConnectorPool.java:124)
at org.apache.manifoldcf.crawlerui.IdleCleanupThread.run(IdleCleanupThread.java:69)
And the error was repeated indefinitely in the log.

Did it mean that there was a problem with the syncharea folder at some point ?

Thanks,
Best regards,

Olivier TAVARD
NEW: Monitor These Apps!
elasticsearch, apache solr, apache hbase, hadoop, redis, casssandra, amazon cloudwatch, mysql, memcached, apache kafka, apache zookeeper, apache storm, ubuntu, centOS, red hat, debian, puppet labs, java, senseiDB