Any way to index the contents of a PDF file with WebCatalog?

This WebDNA talk-list message is from

2000


It keeps the original formatting.
numero = 31859
interpreted = N
texte = Hello,I am trying to work on a catalog of documents type site. the majority of documents are pdf files.The site will allow people to upload their documents and enter some basic information for sorting purposes. I need a way to index the content of the pdf files and put into a WebCatalog database so I can do free text searches as well.Does anyone know of any way to get the text portion of a pdf file into a WebCatalog database short of copy and paste (which is not an option for this project).The end site will be on Solaris (if we can get WebCatalog to stay running for more than a few minutes). I will be doing most of the development on Macintosh (and yes, I am watching case of the filenames ). Of course if any part of the process to get the data from the pdf file requires being done on Solaris only all of the development will move to that platform.Thank you.-- Dale Therio +49 69 263 19977 office Dresdner Kleinwort Benson Research +49 69 263 11379 fax Jürgen-Ponto-Platz 1 +49 170 934 3610 mobile 60301 Frankfurt, Germany------------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://search.smithmicro.com/ Associated Messages, from the most recent to the oldest:

    
  1. Any way to index the contents of a PDF file with WebCatalog? (dale@gmr.dresdner.net 2000)
Hello,I am trying to work on a catalog of documents type site. the majority of documents are pdf files.The site will allow people to upload their documents and enter some basic information for sorting purposes. I need a way to index the content of the pdf files and put into a WebCatalog database so I can do free text searches as well.Does anyone know of any way to get the text portion of a pdf file into a WebCatalog database short of copy and paste (which is not an option for this project).The end site will be on Solaris (if we can get WebCatalog to stay running for more than a few minutes). I will be doing most of the development on Macintosh (and yes, I am watching case of the filenames ). Of course if any part of the process to get the data from the pdf file requires being done on Solaris only all of the development will move to that platform.Thank you.-- Dale Therio +49 69 263 19977 office Dresdner Kleinwort Benson Research +49 69 263 11379 fax Jürgen-Ponto-Platz 1 +49 170 934 3610 mobile 60301 Frankfurt, Germany------------------------------------------------------------- This message is sent to you because you are subscribed to the mailing list . To unsubscribe, E-mail to: To switch to the DIGEST mode, E-mail to Web Archive of this list is at: http://search.smithmicro.com/ dale@gmr.dresdner.net

DOWNLOAD WEBDNA NOW!

Top Articles:

Talk List

The WebDNA community talk-list is the best place to get some help: several hundred extremely proficient programmers with an excellent knowledge of WebDNA and an excellent spirit will deliver all the tips and tricks you can imagine...

Related Readings:

Credit card processing options. . . (1997) [WebDNA] cookie expiration date (2015) MasterCounter - Does this work?? (1999) Country & Ship-to address & other fields ? (1997) WebCat2b13MacPlugIn - [showif][search][/showif] (1997) show all problem (1997) instant email reply (2001) creating a ShipCosts database (1997) WebCat2b15MacPlugIn - [authenticate] not [protect] (1997) Multiple Ad databases? (1997) WebCatb15 Mac CGI -- [purchase] (1997) WebCatalog 3.0.8 is on FTP... (2000) Rhapsody? (1997) two unique banners on one page (1997) Giving out error pages (1997) quotes and truncating? (1997) Middle Context (2002) [LOOKUP] (1997) where to put code (1998) Re:Searching for ALL / empty form field *the FINAL answer* (1997)