Velocity Reviews - Computer Hardware Reviews

Velocity Reviews > Newsgroups > Programming > ASP .Net > ASP General > Problem when searching for PDFs with Indexing Service in ASP-solution.

Reply
Thread Tools

Problem when searching for PDFs with Indexing Service in ASP-solution.

 
 
Martin Emanuelsson
Guest
Posts: n/a
 
      10-15-2004
Hello,

I have a problem with a small asp-solution that searches for PDF-documents
with
indexing service.

For some files in the search result I get gibberish returned, such as
************************************************** ****
I$OYDURSURGXFWVSURGXFHGLQ0H[LFR/DERUDWRU\5HSRUW/DERUDWRU\2UGHUHU5HVSRQVLEOH6
WDWXV)HPLQLQH*27-RKDQVVRQ6XVDQQH)LQDO'DWH)LQDO3URMHFW3URMHFW1DPH&RV WSODFH9HU
1R$9$523'36XPPDU\7KHUHVXOWV5XQ2II7KHSURGXFWVWKDWZH UHSURGXFHGZHUHEDG7KHVXUIDF
HPDWHULDOZDVK\GURSKRELFDQGDOOWKHSURGXFWVKDGUXQRII6 HHSLFWXUH7KHSURGXFWVWKDWSU
RGXFHGZHUHJRRG,WZDVWKHVDPHSURGXFWVWKDWSURGXFHGEXWZ LWKVSXQERQG%XURSHVXUIDFHPD
WHULDO7KHSURGXFWVKDGIDVWLQOHWJRRGVSUHDGLQJLQWKHFRU HDQGQRUXQRII'RVLPDW7KHSURG
XFWSURGXFHGZDVEDG6HYHUDORIWKHSURGX
************************************************** ****

while other files returns "good text" like this:
************************************************** ****
Feminine 865106-Date Final Projectname Orderer 2004-06-02 ALVARO PDP
Johansson Susanne Distributed to: Internal test Alvaro v. 20-21 Summary
Mission Background Comments Conclusion Test methods Test objects Sample No:
20040527-001-01 Alvaro Labrep 2_2.rep SEBJOIS 2004-03-17 Printed by:
labreporter 2004-06-02 15:51:51Laboratory Report No:20040527-001 Rev: 1
Status:Final Brand /Name SABA Ultr
************************************************** ****

The only difference between these files are that they seem to be saved with
different PDF versions or something like that (looking in File --> Document
Properties of the files).

The "bad" file has the following information there:
Creator: Windows NT 4.0
Producer: Acrobat Distiller Daemon 3.01 for HP-UX A.09.01 and later (HPPA)
PDF version: 1.1 (Acrobat 2.x)

The "good" file has the following information:
Creator: AdobePS5.dll Version 5.1.2
Producer: Acrobat Distiller 4.0 for Windows
PDF version: 1.3 (Acrobat 4.x)

A small part of the code looks like this:
************************************************** ****
set objConnection = Server.CreateObject("ADODB.Connection")
set objIndex = Server.CreateObject("ADODB.Recordset")
objConnection.ConnectionString = "Provider=MSIDXS;"
objConnection.Open
strSQL = "SELECT Characterization, Filename, Path FROM
se_got_data.limspdf..SCOPE() WHERE "

objIndex.Open strSQL, objConnection

do until objIndex.EOF
Response.write objIndex("Characterization")
objIndex.MoveNext
loop
objConnection.Close
Set objConnection = nothing
************************************************** ****

The problem seems to be this Characterization-part of the earlier version of
PDFs. Has anyone experienced anything like this before??

Best regards
Martin Emanuelsson
Gothenburg, Sweden


 
Reply With Quote
 
 
 
 
me
Guest
Posts: n/a
 
      10-15-2004
You need to install the Plugin. Acquire it from Adobe.
"Hilary Cotter" <> wrote in message
news:...
> could you post these problem docs here?
>
> --
> Hilary Cotter
> Looking for a SQL Server replication book?
> http://www.nwsu.com/0974973602.html
>
>
> "Martin Emanuelsson" <> wrote in message
> news:%...
> > Hello,
> >
> > I have a problem with a small asp-solution that searches for

PDF-documents
> > with
> > indexing service.
> >
> > For some files in the search result I get gibberish returned, such as
> > ************************************************** ****
> >

>

I$OYDURSURGXFWVSURGXFHGLQ0H[LFR/DERUDWRU\5HSRUW/DERUDWRU\2UGHUHU5HVSRQVLEOH6
> >

>

WDWXV)HPLQLQH*27-RKDQVVRQ6XVDQQH)LQDO'DWH)LQDO3URMHFW3URMHFW1DPH&RV WSODFH9HU
> >

>

1R$9$523'36XPPDU\7KHUHVXOWV5XQ2II7KHSURGXFWVWKDWZH UHSURGXFHGZHUHEDG7KHVXUIDF
> >

>

HPDWHULDOZDVK\GURSKRELFDQGDOOWKHSURGXFWVKDGUXQRII6 HHSLFWXUH7KHSURGXFWVWKDWSU
> >

>

RGXFHGZHUHJRRG,WZDVWKHVDPHSURGXFWVWKDWSURGXFHGEXWZ LWKVSXQERQG%XURSHVXUIDFHPD
> >

>

WHULDO7KHSURGXFWVKDGIDVWLQOHWJRRGVSUHDGLQJLQWKHFRU HDQGQRUXQRII'RVLPDW7KHSURG
> > XFWSURGXFHGZDVEDG6HYHUDORIWKHSURGX
> > ************************************************** ****
> >
> > while other files returns "good text" like this:
> > ************************************************** ****
> > Feminine 865106-Date Final Projectname Orderer 2004-06-02 ALVARO PDP
> > Johansson Susanne Distributed to: Internal test Alvaro v. 20-21 Summary
> > Mission Background Comments Conclusion Test methods Test objects Sample

> No:
> > 20040527-001-01 Alvaro Labrep 2_2.rep SEBJOIS 2004-03-17 Printed by:
> > labreporter 2004-06-02 15:51:51Laboratory Report No:20040527-001 Rev: 1
> > Status:Final Brand /Name SABA Ultr
> > ************************************************** ****
> >
> > The only difference between these files are that they seem to be saved

> with
> > different PDF versions or something like that (looking in File -->

> Document
> > Properties of the files).
> >
> > The "bad" file has the following information there:
> > Creator: Windows NT 4.0
> > Producer: Acrobat Distiller Daemon 3.01 for HP-UX A.09.01 and later

(HPPA)
> > PDF version: 1.1 (Acrobat 2.x)
> >
> > The "good" file has the following information:
> > Creator: AdobePS5.dll Version 5.1.2
> > Producer: Acrobat Distiller 4.0 for Windows
> > PDF version: 1.3 (Acrobat 4.x)
> >
> > A small part of the code looks like this:
> > ************************************************** ****
> > set objConnection = Server.CreateObject("ADODB.Connection")
> > set objIndex = Server.CreateObject("ADODB.Recordset")
> > objConnection.ConnectionString = "Provider=MSIDXS;"
> > objConnection.Open
> > strSQL = "SELECT Characterization, Filename, Path FROM
> > se_got_data.limspdf..SCOPE() WHERE "
> >
> > objIndex.Open strSQL, objConnection
> >
> > do until objIndex.EOF
> > Response.write objIndex("Characterization")
> > objIndex.MoveNext
> > loop
> > objConnection.Close
> > Set objConnection = nothing
> > ************************************************** ****
> >
> > The problem seems to be this Characterization-part of the earlier

version
> of
> > PDFs. Has anyone experienced anything like this before??
> >
> > Best regards
> > Martin Emanuelsson
> > Gothenburg, Sweden
> >
> >

>
>



 
Reply With Quote
 
 
 
 
Hilary Cotter
Guest
Posts: n/a
 
      10-15-2004
could you post these problem docs here?

--
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html


"Martin Emanuelsson" <> wrote in message
news:%...
> Hello,
>
> I have a problem with a small asp-solution that searches for PDF-documents
> with
> indexing service.
>
> For some files in the search result I get gibberish returned, such as
> ************************************************** ****
>

I$OYDURSURGXFWVSURGXFHGLQ0H[LFR/DERUDWRU\5HSRUW/DERUDWRU\2UGHUHU5HVSRQVLEOH6
>

WDWXV)HPLQLQH*27-RKDQVVRQ6XVDQQH)LQDO'DWH)LQDO3URMHFW3URMHFW1DPH&RV WSODFH9HU
>

1R$9$523'36XPPDU\7KHUHVXOWV5XQ2II7KHSURGXFWVWKDWZH UHSURGXFHGZHUHEDG7KHVXUIDF
>

HPDWHULDOZDVK\GURSKRELFDQGDOOWKHSURGXFWVKDGUXQRII6 HHSLFWXUH7KHSURGXFWVWKDWSU
>

RGXFHGZHUHJRRG,WZDVWKHVDPHSURGXFWVWKDWSURGXFHGEXWZ LWKVSXQERQG%XURSHVXUIDFHPD
>

WHULDO7KHSURGXFWVKDGIDVWLQOHWJRRGVSUHDGLQJLQWKHFRU HDQGQRUXQRII'RVLPDW7KHSURG
> XFWSURGXFHGZDVEDG6HYHUDORIWKHSURGX
> ************************************************** ****
>
> while other files returns "good text" like this:
> ************************************************** ****
> Feminine 865106-Date Final Projectname Orderer 2004-06-02 ALVARO PDP
> Johansson Susanne Distributed to: Internal test Alvaro v. 20-21 Summary
> Mission Background Comments Conclusion Test methods Test objects Sample

No:
> 20040527-001-01 Alvaro Labrep 2_2.rep SEBJOIS 2004-03-17 Printed by:
> labreporter 2004-06-02 15:51:51Laboratory Report No:20040527-001 Rev: 1
> Status:Final Brand /Name SABA Ultr
> ************************************************** ****
>
> The only difference between these files are that they seem to be saved

with
> different PDF versions or something like that (looking in File -->

Document
> Properties of the files).
>
> The "bad" file has the following information there:
> Creator: Windows NT 4.0
> Producer: Acrobat Distiller Daemon 3.01 for HP-UX A.09.01 and later (HPPA)
> PDF version: 1.1 (Acrobat 2.x)
>
> The "good" file has the following information:
> Creator: AdobePS5.dll Version 5.1.2
> Producer: Acrobat Distiller 4.0 for Windows
> PDF version: 1.3 (Acrobat 4.x)
>
> A small part of the code looks like this:
> ************************************************** ****
> set objConnection = Server.CreateObject("ADODB.Connection")
> set objIndex = Server.CreateObject("ADODB.Recordset")
> objConnection.ConnectionString = "Provider=MSIDXS;"
> objConnection.Open
> strSQL = "SELECT Characterization, Filename, Path FROM
> se_got_data.limspdf..SCOPE() WHERE "
>
> objIndex.Open strSQL, objConnection
>
> do until objIndex.EOF
> Response.write objIndex("Characterization")
> objIndex.MoveNext
> loop
> objConnection.Close
> Set objConnection = nothing
> ************************************************** ****
>
> The problem seems to be this Characterization-part of the earlier version

of
> PDFs. Has anyone experienced anything like this before??
>
> Best regards
> Martin Emanuelsson
> Gothenburg, Sweden
>
>



 
Reply With Quote
 
Martin Emanuelsson
Guest
Posts: n/a
 
      10-15-2004
If you mean the plugin Adobe PDF IFilter 5.0 then it is already installed on
the server so that is not the problem. Not unless there is some sort of
setting that needs to be done for the plugin?

/Martin


"me" <> wrote in message
news:%233r%...
> You need to install the Plugin. Acquire it from Adobe.
> "Hilary Cotter" <> wrote in message
> news:...
> > could you post these problem docs here?
> >
> > --
> > Hilary Cotter
> > Looking for a SQL Server replication book?
> > http://www.nwsu.com/0974973602.html
> >
> >
> > "Martin Emanuelsson" <> wrote in message
> > news:%...
> > > Hello,
> > >
> > > I have a problem with a small asp-solution that searches for

> PDF-documents
> > > with
> > > indexing service.
> > >
> > > For some files in the search result I get gibberish returned, such as
> > > ************************************************** ****
> > >

> >

>

I$OYDURSURGXFWVSURGXFHGLQ0H[LFR/DERUDWRU\5HSRUW/DERUDWRU\2UGHUHU5HVSRQVLEOH6
> > >

> >

>

WDWXV)HPLQLQH*27-RKDQVVRQ6XVDQQH)LQDO'DWH)LQDO3URMHFW3URMHFW1DPH&RV WSODFH9HU
> > >

> >

>

1R$9$523'36XPPDU\7KHUHVXOWV5XQ2II7KHSURGXFWVWKDWZH UHSURGXFHGZHUHEDG7KHVXUIDF
> > >

> >

>

HPDWHULDOZDVK\GURSKRELFDQGDOOWKHSURGXFWVKDGUXQRII6 HHSLFWXUH7KHSURGXFWVWKDWSU
> > >

> >

>

RGXFHGZHUHJRRG,WZDVWKHVDPHSURGXFWVWKDWSURGXFHGEXWZ LWKVSXQERQG%XURSHVXUIDFHPD
> > >

> >

>

WHULDO7KHSURGXFWVKDGIDVWLQOHWJRRGVSUHDGLQJLQWKHFRU HDQGQRUXQRII'RVLPDW7KHSURG
> > > XFWSURGXFHGZDVEDG6HYHUDORIWKHSURGX
> > > ************************************************** ****
> > >
> > > while other files returns "good text" like this:
> > > ************************************************** ****
> > > Feminine 865106-Date Final Projectname Orderer 2004-06-02 ALVARO PDP
> > > Johansson Susanne Distributed to: Internal test Alvaro v. 20-21

Summary
> > > Mission Background Comments Conclusion Test methods Test objects

Sample
> > No:
> > > 20040527-001-01 Alvaro Labrep 2_2.rep SEBJOIS 2004-03-17 Printed by:
> > > labreporter 2004-06-02 15:51:51Laboratory Report No:20040527-001 Rev:

1
> > > Status:Final Brand /Name SABA Ultr
> > > ************************************************** ****
> > >
> > > The only difference between these files are that they seem to be saved

> > with
> > > different PDF versions or something like that (looking in File -->

> > Document
> > > Properties of the files).
> > >
> > > The "bad" file has the following information there:
> > > Creator: Windows NT 4.0
> > > Producer: Acrobat Distiller Daemon 3.01 for HP-UX A.09.01 and later

> (HPPA)
> > > PDF version: 1.1 (Acrobat 2.x)
> > >
> > > The "good" file has the following information:
> > > Creator: AdobePS5.dll Version 5.1.2
> > > Producer: Acrobat Distiller 4.0 for Windows
> > > PDF version: 1.3 (Acrobat 4.x)
> > >
> > > A small part of the code looks like this:
> > > ************************************************** ****
> > > set objConnection = Server.CreateObject("ADODB.Connection")
> > > set objIndex = Server.CreateObject("ADODB.Recordset")
> > > objConnection.ConnectionString = "Provider=MSIDXS;"
> > > objConnection.Open
> > > strSQL = "SELECT Characterization, Filename, Path FROM
> > > se_got_data.limspdf..SCOPE() WHERE "
> > >
> > > objIndex.Open strSQL, objConnection
> > >
> > > do until objIndex.EOF
> > > Response.write objIndex("Characterization")
> > > objIndex.MoveNext
> > > loop
> > > objConnection.Close
> > > Set objConnection = nothing
> > > ************************************************** ****
> > >
> > > The problem seems to be this Characterization-part of the earlier

> version
> > of
> > > PDFs. Has anyone experienced anything like this before??
> > >
> > > Best regards
> > > Martin Emanuelsson
> > > Gothenburg, Sweden
> > >
> > >

> >
> >

>
>



 
Reply With Quote
 
Martin Emanuelsson
Guest
Posts: n/a
 
      10-15-2004
I have to check with the business people where I'm working if that is ok or
if it's all confidential. I'll get back to you as soon as possible.

/Martin


"Hilary Cotter" <> wrote in message
news:...
> could you post these problem docs here?
>
> --
> Hilary Cotter
> Looking for a SQL Server replication book?
> http://www.nwsu.com/0974973602.html
>
>
> "Martin Emanuelsson" <> wrote in message
> news:%...
> > Hello,
> >
> > I have a problem with a small asp-solution that searches for

PDF-documents
> > with
> > indexing service.
> >
> > For some files in the search result I get gibberish returned, such as
> > ************************************************** ****
> >

>

I$OYDURSURGXFWVSURGXFHGLQ0H[LFR/DERUDWRU\5HSRUW/DERUDWRU\2UGHUHU5HVSRQVLEOH6
> >

>

WDWXV)HPLQLQH*27-RKDQVVRQ6XVDQQH)LQDO'DWH)LQDO3URMHFW3URMHFW1DPH&RV WSODFH9HU
> >

>

1R$9$523'36XPPDU\7KHUHVXOWV5XQ2II7KHSURGXFWVWKDWZH UHSURGXFHGZHUHEDG7KHVXUIDF
> >

>

HPDWHULDOZDVK\GURSKRELFDQGDOOWKHSURGXFWVKDGUXQRII6 HHSLFWXUH7KHSURGXFWVWKDWSU
> >

>

RGXFHGZHUHJRRG,WZDVWKHVDPHSURGXFWVWKDWSURGXFHGEXWZ LWKVSXQERQG%XURSHVXUIDFHPD
> >

>

WHULDO7KHSURGXFWVKDGIDVWLQOHWJRRGVSUHDGLQJLQWKHFRU HDQGQRUXQRII'RVLPDW7KHSURG
> > XFWSURGXFHGZDVEDG6HYHUDORIWKHSURGX
> > ************************************************** ****
> >
> > while other files returns "good text" like this:
> > ************************************************** ****
> > Feminine 865106-Date Final Projectname Orderer 2004-06-02 ALVARO PDP
> > Johansson Susanne Distributed to: Internal test Alvaro v. 20-21 Summary
> > Mission Background Comments Conclusion Test methods Test objects Sample

> No:
> > 20040527-001-01 Alvaro Labrep 2_2.rep SEBJOIS 2004-03-17 Printed by:
> > labreporter 2004-06-02 15:51:51Laboratory Report No:20040527-001 Rev: 1
> > Status:Final Brand /Name SABA Ultr
> > ************************************************** ****
> >
> > The only difference between these files are that they seem to be saved

> with
> > different PDF versions or something like that (looking in File -->

> Document
> > Properties of the files).
> >
> > The "bad" file has the following information there:
> > Creator: Windows NT 4.0
> > Producer: Acrobat Distiller Daemon 3.01 for HP-UX A.09.01 and later

(HPPA)
> > PDF version: 1.1 (Acrobat 2.x)
> >
> > The "good" file has the following information:
> > Creator: AdobePS5.dll Version 5.1.2
> > Producer: Acrobat Distiller 4.0 for Windows
> > PDF version: 1.3 (Acrobat 4.x)
> >
> > A small part of the code looks like this:
> > ************************************************** ****
> > set objConnection = Server.CreateObject("ADODB.Connection")
> > set objIndex = Server.CreateObject("ADODB.Recordset")
> > objConnection.ConnectionString = "Provider=MSIDXS;"
> > objConnection.Open
> > strSQL = "SELECT Characterization, Filename, Path FROM
> > se_got_data.limspdf..SCOPE() WHERE "
> >
> > objIndex.Open strSQL, objConnection
> >
> > do until objIndex.EOF
> > Response.write objIndex("Characterization")
> > objIndex.MoveNext
> > loop
> > objConnection.Close
> > Set objConnection = nothing
> > ************************************************** ****
> >
> > The problem seems to be this Characterization-part of the earlier

version
> of
> > PDFs. Has anyone experienced anything like this before??
> >
> > Best regards
> > Martin Emanuelsson
> > Gothenburg, Sweden
> >
> >

>
>



 
Reply With Quote
 
Martin Emanuelsson
Guest
Posts: n/a
 
      10-18-2004
I tried posting two test files to this newsgroup but got an error message
saying the message was too big (with two attachements with a size of about
130 kb in total).

Could send them directly to you if that's ok Hilary? And to anyone else
interested for that matter.

/Martin



"Hilary Cotter" <> wrote in message
news:...
> could you post these problem docs here?
>
> --
> Hilary Cotter
> Looking for a SQL Server replication book?
> http://www.nwsu.com/0974973602.html
>
>
> "Martin Emanuelsson" <> wrote in message
> news:%...
> > Hello,
> >
> > I have a problem with a small asp-solution that searches for

PDF-documents
> > with
> > indexing service.
> >
> > For some files in the search result I get gibberish returned, such as
> > ************************************************** ****
> >

>

I$OYDURSURGXFWVSURGXFHGLQ0H[LFR/DERUDWRU\5HSRUW/DERUDWRU\2UGHUHU5HVSRQVLEOH6
> >

>

WDWXV)HPLQLQH*27-RKDQVVRQ6XVDQQH)LQDO'DWH)LQDO3URMHFW3URMHFW1DPH&RV WSODFH9HU
> >

>

1R$9$523'36XPPDU\7KHUHVXOWV5XQ2II7KHSURGXFWVWKDWZH UHSURGXFHGZHUHEDG7KHVXUIDF
> >

>

HPDWHULDOZDVK\GURSKRELFDQGDOOWKHSURGXFWVKDGUXQRII6 HHSLFWXUH7KHSURGXFWVWKDWSU
> >

>

RGXFHGZHUHJRRG,WZDVWKHVDPHSURGXFWVWKDWSURGXFHGEXWZ LWKVSXQERQG%XURSHVXUIDFHPD
> >

>

WHULDO7KHSURGXFWVKDGIDVWLQOHWJRRGVSUHDGLQJLQWKHFRU HDQGQRUXQRII'RVLPDW7KHSURG
> > XFWSURGXFHGZDVEDG6HYHUDORIWKHSURGX
> > ************************************************** ****
> >
> > while other files returns "good text" like this:
> > ************************************************** ****
> > Feminine 865106-Date Final Projectname Orderer 2004-06-02 ALVARO PDP
> > Johansson Susanne Distributed to: Internal test Alvaro v. 20-21 Summary
> > Mission Background Comments Conclusion Test methods Test objects Sample

> No:
> > 20040527-001-01 Alvaro Labrep 2_2.rep SEBJOIS 2004-03-17 Printed by:
> > labreporter 2004-06-02 15:51:51Laboratory Report No:20040527-001 Rev: 1
> > Status:Final Brand /Name SABA Ultr
> > ************************************************** ****
> >
> > The only difference between these files are that they seem to be saved

> with
> > different PDF versions or something like that (looking in File -->

> Document
> > Properties of the files).
> >
> > The "bad" file has the following information there:
> > Creator: Windows NT 4.0
> > Producer: Acrobat Distiller Daemon 3.01 for HP-UX A.09.01 and later

(HPPA)
> > PDF version: 1.1 (Acrobat 2.x)
> >
> > The "good" file has the following information:
> > Creator: AdobePS5.dll Version 5.1.2
> > Producer: Acrobat Distiller 4.0 for Windows
> > PDF version: 1.3 (Acrobat 4.x)
> >
> > A small part of the code looks like this:
> > ************************************************** ****
> > set objConnection = Server.CreateObject("ADODB.Connection")
> > set objIndex = Server.CreateObject("ADODB.Recordset")
> > objConnection.ConnectionString = "Provider=MSIDXS;"
> > objConnection.Open
> > strSQL = "SELECT Characterization, Filename, Path FROM
> > se_got_data.limspdf..SCOPE() WHERE "
> >
> > objIndex.Open strSQL, objConnection
> >
> > do until objIndex.EOF
> > Response.write objIndex("Characterization")
> > objIndex.MoveNext
> > loop
> > objConnection.Close
> > Set objConnection = nothing
> > ************************************************** ****
> >
> > The problem seems to be this Characterization-part of the earlier

version
> of
> > PDFs. Has anyone experienced anything like this before??
> >
> > Best regards
> > Martin Emanuelsson
> > Gothenburg, Sweden
> >
> >

>
>



 
Reply With Quote
 
Hilary Cotter
Guest
Posts: n/a
 
      10-18-2004
sure.
"Martin Emanuelsson" <> wrote in message
news:...
> I tried posting two test files to this newsgroup but got an error message
> saying the message was too big (with two attachements with a size of about
> 130 kb in total).
>
> Could send them directly to you if that's ok Hilary? And to anyone else
> interested for that matter.
>
> /Martin
>
>
>
> "Hilary Cotter" <> wrote in message
> news:...
> > could you post these problem docs here?
> >
> > --
> > Hilary Cotter
> > Looking for a SQL Server replication book?
> > http://www.nwsu.com/0974973602.html
> >
> >
> > "Martin Emanuelsson" <> wrote in message
> > news:%...
> > > Hello,
> > >
> > > I have a problem with a small asp-solution that searches for

> PDF-documents
> > > with
> > > indexing service.
> > >
> > > For some files in the search result I get gibberish returned, such as
> > > ************************************************** ****
> > >

> >

>

I$OYDURSURGXFWVSURGXFHGLQ0H[LFR/DERUDWRU\5HSRUW/DERUDWRU\2UGHUHU5HVSRQVLEOH6
> > >

> >

>

WDWXV)HPLQLQH*27-RKDQVVRQ6XVDQQH)LQDO'DWH)LQDO3URMHFW3URMHFW1DPH&RV WSODFH9HU
> > >

> >

>

1R$9$523'36XPPDU\7KHUHVXOWV5XQ2II7KHSURGXFWVWKDWZH UHSURGXFHGZHUHEDG7KHVXUIDF
> > >

> >

>

HPDWHULDOZDVK\GURSKRELFDQGDOOWKHSURGXFWVKDGUXQRII6 HHSLFWXUH7KHSURGXFWVWKDWSU
> > >

> >

>

RGXFHGZHUHJRRG,WZDVWKHVDPHSURGXFWVWKDWSURGXFHGEXWZ LWKVSXQERQG%XURSHVXUIDFHPD
> > >

> >

>

WHULDO7KHSURGXFWVKDGIDVWLQOHWJRRGVSUHDGLQJLQWKHFRU HDQGQRUXQRII'RVLPDW7KHSURG
> > > XFWSURGXFHGZDVEDG6HYHUDORIWKHSURGX
> > > ************************************************** ****
> > >
> > > while other files returns "good text" like this:
> > > ************************************************** ****
> > > Feminine 865106-Date Final Projectname Orderer 2004-06-02 ALVARO PDP
> > > Johansson Susanne Distributed to: Internal test Alvaro v. 20-21

Summary
> > > Mission Background Comments Conclusion Test methods Test objects

Sample
> > No:
> > > 20040527-001-01 Alvaro Labrep 2_2.rep SEBJOIS 2004-03-17 Printed by:
> > > labreporter 2004-06-02 15:51:51Laboratory Report No:20040527-001 Rev:

1
> > > Status:Final Brand /Name SABA Ultr
> > > ************************************************** ****
> > >
> > > The only difference between these files are that they seem to be saved

> > with
> > > different PDF versions or something like that (looking in File -->

> > Document
> > > Properties of the files).
> > >
> > > The "bad" file has the following information there:
> > > Creator: Windows NT 4.0
> > > Producer: Acrobat Distiller Daemon 3.01 for HP-UX A.09.01 and later

> (HPPA)
> > > PDF version: 1.1 (Acrobat 2.x)
> > >
> > > The "good" file has the following information:
> > > Creator: AdobePS5.dll Version 5.1.2
> > > Producer: Acrobat Distiller 4.0 for Windows
> > > PDF version: 1.3 (Acrobat 4.x)
> > >
> > > A small part of the code looks like this:
> > > ************************************************** ****
> > > set objConnection = Server.CreateObject("ADODB.Connection")
> > > set objIndex = Server.CreateObject("ADODB.Recordset")
> > > objConnection.ConnectionString = "Provider=MSIDXS;"
> > > objConnection.Open
> > > strSQL = "SELECT Characterization, Filename, Path FROM
> > > se_got_data.limspdf..SCOPE() WHERE "
> > >
> > > objIndex.Open strSQL, objConnection
> > >
> > > do until objIndex.EOF
> > > Response.write objIndex("Characterization")
> > > objIndex.MoveNext
> > > loop
> > > objConnection.Close
> > > Set objConnection = nothing
> > > ************************************************** ****
> > >
> > > The problem seems to be this Characterization-part of the earlier

> version
> > of
> > > PDFs. Has anyone experienced anything like this before??
> > >
> > > Best regards
> > > Martin Emanuelsson
> > > Gothenburg, Sweden
> > >
> > >

> >
> >

>
>



 
Reply With Quote
 
Hilary Cotter
Guest
Posts: n/a
 
      10-18-2004
the jibberish is in these docs.

There isn't a whole lot you can do, other than convert all the docs to the
"good" format. you might want to talk to Adobe.

--
Hilary Cotter
Looking for a SQL Server replication book?
http://www.nwsu.com/0974973602.html


"Hilary Cotter" <> wrote in message
news:...
> sure.
> "Martin Emanuelsson" <> wrote in message
> news:...
> > I tried posting two test files to this newsgroup but got an error

message
> > saying the message was too big (with two attachements with a size of

about
> > 130 kb in total).
> >
> > Could send them directly to you if that's ok Hilary? And to anyone else
> > interested for that matter.
> >
> > /Martin
> >
> >
> >
> > "Hilary Cotter" <> wrote in message
> > news:...
> > > could you post these problem docs here?
> > >
> > > --
> > > Hilary Cotter
> > > Looking for a SQL Server replication book?
> > > http://www.nwsu.com/0974973602.html
> > >
> > >
> > > "Martin Emanuelsson" <> wrote in message
> > > news:%...
> > > > Hello,
> > > >
> > > > I have a problem with a small asp-solution that searches for

> > PDF-documents
> > > > with
> > > > indexing service.
> > > >
> > > > For some files in the search result I get gibberish returned, such

as
> > > > ************************************************** ****
> > > >
> > >

> >

>

I$OYDURSURGXFWVSURGXFHGLQ0H[LFR/DERUDWRU\5HSRUW/DERUDWRU\2UGHUHU5HVSRQVLEOH6
> > > >
> > >

> >

>

WDWXV)HPLQLQH*27-RKDQVVRQ6XVDQQH)LQDO'DWH)LQDO3URMHFW3URMHFW1DPH&RV WSODFH9HU
> > > >
> > >

> >

>

1R$9$523'36XPPDU\7KHUHVXOWV5XQ2II7KHSURGXFWVWKDWZH UHSURGXFHGZHUHEDG7KHVXUIDF
> > > >
> > >

> >

>

HPDWHULDOZDVK\GURSKRELFDQGDOOWKHSURGXFWVKDGUXQRII6 HHSLFWXUH7KHSURGXFWVWKDWSU
> > > >
> > >

> >

>

RGXFHGZHUHJRRG,WZDVWKHVDPHSURGXFWVWKDWSURGXFHGEXWZ LWKVSXQERQG%XURSHVXUIDFHPD
> > > >
> > >

> >

>

WHULDO7KHSURGXFWVKDGIDVWLQOHWJRRGVSUHDGLQJLQWKHFRU HDQGQRUXQRII'RVLPDW7KHSURG
> > > > XFWSURGXFHGZDVEDG6HYHUDORIWKHSURGX
> > > > ************************************************** ****
> > > >
> > > > while other files returns "good text" like this:
> > > > ************************************************** ****
> > > > Feminine 865106-Date Final Projectname Orderer 2004-06-02 ALVARO PDP
> > > > Johansson Susanne Distributed to: Internal test Alvaro v. 20-21

> Summary
> > > > Mission Background Comments Conclusion Test methods Test objects

> Sample
> > > No:
> > > > 20040527-001-01 Alvaro Labrep 2_2.rep SEBJOIS 2004-03-17 Printed by:
> > > > labreporter 2004-06-02 15:51:51Laboratory Report No:20040527-001

Rev:
> 1
> > > > Status:Final Brand /Name SABA Ultr
> > > > ************************************************** ****
> > > >
> > > > The only difference between these files are that they seem to be

saved
> > > with
> > > > different PDF versions or something like that (looking in File -->
> > > Document
> > > > Properties of the files).
> > > >
> > > > The "bad" file has the following information there:
> > > > Creator: Windows NT 4.0
> > > > Producer: Acrobat Distiller Daemon 3.01 for HP-UX A.09.01 and later

> > (HPPA)
> > > > PDF version: 1.1 (Acrobat 2.x)
> > > >
> > > > The "good" file has the following information:
> > > > Creator: AdobePS5.dll Version 5.1.2
> > > > Producer: Acrobat Distiller 4.0 for Windows
> > > > PDF version: 1.3 (Acrobat 4.x)
> > > >
> > > > A small part of the code looks like this:
> > > > ************************************************** ****
> > > > set objConnection = Server.CreateObject("ADODB.Connection")
> > > > set objIndex = Server.CreateObject("ADODB.Recordset")
> > > > objConnection.ConnectionString = "Provider=MSIDXS;"
> > > > objConnection.Open
> > > > strSQL = "SELECT Characterization, Filename, Path FROM
> > > > se_got_data.limspdf..SCOPE() WHERE "
> > > >
> > > > objIndex.Open strSQL, objConnection
> > > >
> > > > do until objIndex.EOF
> > > > Response.write objIndex("Characterization")
> > > > objIndex.MoveNext
> > > > loop
> > > > objConnection.Close
> > > > Set objConnection = nothing
> > > > ************************************************** ****
> > > >
> > > > The problem seems to be this Characterization-part of the earlier

> > version
> > > of
> > > > PDFs. Has anyone experienced anything like this before??
> > > >
> > > > Best regards
> > > > Martin Emanuelsson
> > > > Gothenburg, Sweden
> > > >
> > > >
> > >
> > >

> >
> >

>
>



 
Reply With Quote
 
 
 
Reply

Thread Tools

Posting Rules
You may not post new threads
You may not post replies
You may not post attachments
You may not edit your posts

BB code is On
Smilies are On
[IMG] code is On
HTML code is Off
Trackbacks are On
Pingbacks are On
Refbacks are Off


Similar Threads
Thread Thread Starter Forum Replies Last Post
Need Help with Indexing Service Not Showing PDFs in Results =?Utf-8?B?cmF6b3I=?= Windows 64bit 1 09-06-2007 02:20 AM
Indexing services under Windows XP SP2 - Can I disable MS Indexing Service to hasten Google's OR does Google Desktop uses this MS Indexing Service? ricardodefaria Computer Support 6 08-05-2007 04:14 AM
Searching with IIS' indexing service? Byron ASP General 1 01-06-2006 02:42 PM
Indexing PDF's using MS Indexing Service C ASP General 3 10-17-2003 05:47 PM
Indexing PDF Files using MS Indexing Service C ASP .Net 0 10-17-2003 04:27 PM



Advertisments
 



1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 56 57