read word doc into string

Aug 30, 2012 at 3:47 PM

Hi all i have the following code 

 public string ReadAllTextFromWordDocFile(string fileName)       
using (StreamReader streamReader = new StreamReader(fileName))
var document = new HWPFDocument(streamReader.BaseStream);           
//var document2 = new XWPFDocument(streamReader.BaseStream);           
var wordExtractor = new WordExtractor(document);       
var docText = new StringBuilder()
foreach (string text in wordExtractor.ParagraphText)       
return docText.ToString();   

I have also referenced the following

using NPOI;using

but I get the error The type or namespace name 'HWPFDocument' could not be found

can anyone explain where im going wrong?


Sep 15, 2012 at 11:14 PM

HWPFDocument is not included in any official release because HWPF is very unstable so far. Anyway, you can find the source code from ScratchPad and compile by yourself.

Jul 2, 2013 at 8:22 AM
not included?
It is mentioned in a post at
can't understand what is meant by following "Anyway, you can find the source code from ScratchPad and compile by yourself."
Is the library supports these or not
Nov 18, 2013 at 12:20 AM
It supports it but not in official release. ScratchPad namespace is for test purpose. Any source code in it is not stable.
Apr 15, 2014 at 2:37 PM
Hi, is HWPFDocument going to be included in the official release at some point, or is there another way we can use NPOI to extract text from word 2003 documents?
Apr 23, 2014 at 6:53 AM
We are going to include it in NPOI 2.3. But we need to make it stabler first.