INTERACT FORUM

More => Old Versions => Media Center 11 (Development Ended) => Topic started by: KingSparta on April 10, 2005, 05:21:50 pm

Title: DOC & Text In MC
Post by: KingSparta on April 10, 2005, 05:21:50 pm
Today I loaded Up 46,630 Text Files Info Media Center And Was Wondering If There Was A Way To Search The Doc Files (The Content Of The Doc File)?
Title: Re: DOC & Text In MC
Post by: JimH on April 10, 2005, 07:37:30 pm
I think the search feature works.  Not sure.
Title: Re: DOC & Text In MC
Post by: IanG on April 11, 2005, 02:36:43 am
I think the search feature works.  Not sure.

The search doesn't include file contents.

Ian G.
Title: Re: DOC & Text In MC
Post by: KingSparta on April 11, 2005, 12:56:49 pm
It would be nice if if you can select a number of docs and do a search and MC would do the search in the document body.

search parm:

Doc="The quick brown fox jumps over the lazy dog"

and this would search any text type file (HTML, Txt, Doc etc...)
Title: Re: DOC & Text In MC
Post by: Matt on April 11, 2005, 01:07:03 pm
Use [Text]= to search inside the document.

You can make this a default search field so you don't need to type [Text]= in Options > Library.
Title: Re: DOC & Text In MC
Post by: KingSparta on April 11, 2005, 02:13:01 pm
that works, sort of

it does not return all files with a search string it only returns some of the files.

i typed: text=a

it returned 145 files

i know there is more files out of the 46,000 files with a letter "A"

i used other search parms as well with the same results

can I force it to return all?
Title: Re: DOC & Text In MC
Post by: KingSparta on April 11, 2005, 07:20:39 pm
if i search "text="

using "a" only 145 files out of 46,000+ come back as results

using "e" only 145 files out of 46,000+ come back as results

using "and" only 145 files out of 46,000+ come back as results
Title: Re: DOC & Text In MC
Post by: JimH on April 11, 2005, 08:09:27 pm
if i search "text="
Did you use the square brackets?

[text]=

It's awkward, but that's how it works for now.

You may not have text strings in the tags.  You can check by adding the "text" field to the view.

I had to do "update tags from library.

It isn't working right yet, but it does work.
Title: Re: DOC & Text In MC
Post by: KingSparta on April 12, 2005, 05:55:47 am
Quote
Did you use the square brackets?

[text]=

I did like matt said and put "text=" in the options>library

but yes i did

i did add the field "text" to view there is nothing In "Text"

i guess when i did the orginal import nothing was added for some reason.

I did notice this bug in import:

1. if the import folders are checked
2. you uncheck them
3. add a new folder
4. the folders that you unchecked are now checked again

Title: Re: DOC & Text In MC
Post by: JimH on April 12, 2005, 07:07:01 am
i did add the field "text" to view there is nothing In "Text"
That's the problem.  Maybe it was broken in some previous build or during conversion.  Try the update Library from properties (I think).
Title: Re: DOC & Text In MC
Post by: Alex B on April 12, 2005, 07:21:15 am
Quote
Try the update Library from properties...

A word of warning: The contents of the Name field will be lost. MC replaces the information with the filename.
Title: Re: DOC & Text In MC
Post by: IanG on April 12, 2005, 07:58:30 am
A word of warning: The contents of the Name field will be lost. MC replaces the information with the filename.

I've used "Update library from tags", which populates the name field from the title property in an MS Office doc, but where's "update library from properties"? 

Ian G.
Title: Re: DOC & Text In MC
Post by: Alex B on April 12, 2005, 08:04:07 am
Jim meant "Update Library (from tags)", but if there are no file tags containing metadata that MC can understand it picks up the filenames.
Title: Re: DOC & Text In MC
Post by: KingSparta on April 12, 2005, 11:20:50 am
Jim meant "Update Library (from tags)", but if there are no file tags containing metadata that MC can understand it picks up the filenames.

Correct.

At any rate i may have brought MC down to it's Knees (or my computer)

i left it updating the library from tags this morning

i just came home to check on it and it basicly stopped with many error messages that had to be clicked away.

windows also gave me warnings of out of Swap space so it increased it for me (so it said). I may need more memory (512 megs in here now).

if you try to sort "Text" forget it MJ goes into lala land for sometime (about a 6 pack)

trying to do a search renders the same effect, (i may need a main frame)

starting MC took 2 mins


so i rebooted I may try again later but i have a project going on
Title: Re: DOC & Text In MC
Post by: IanG on April 13, 2005, 02:58:19 am
It looks like MC can find strings in text files (.txt, .htm etc) but it can't find them in the body of MS Office files (my idea of a "document") - only the title is searched.  Is this the expected behaviour, and are there any plans to include Office files?

Ian G.
Title: Re: DOC & Text In MC
Post by: Tab on April 14, 2005, 11:03:45 am
I havent tried working with docs in MC yet. But if theres one thing that would be most useful for a search function, its the ability to search for any incidence of 2 separate strings on one line.

Tab
Title: Re: DOC & Text In MC
Post by: KingSparta on April 14, 2005, 12:01:30 pm
It looks like MC can find strings in text files (.txt, .htm etc) but it can't find them in the body of MS Office files (my idea of a "document") - only the title is searched.  Is this the expected behaviour, and are there any plans to include Office files?

Ian G.

I think it will not import html files into the "Text" field

at least on mine it is not showing any text.

this field also can not be edited.

I am not sure reading all of the text into the database is the correct answer on how to deal with this issue with searching text\doc files.

it may take more time but scaning the marked files maybe better since it will not read all of them into memory or into the database.

I am also sure this is not #1 on J rivers we just got to do this, or i will die list.
Title: Re: DOC & Text In MC
Post by: IanG on April 14, 2005, 05:21:22 pm
I think it will not import html files into the "Text" field


It's inconsistent on my PC - search strings are found in some .html files but not in others.

Personaly, I'd prefer to see the indexing in the db - I'd rather take the hit of increased disk use and import time than have to wait for files to be searched. 

Ian G.