INTERACT FORUM

Please login or register.

Login with username, password and session length
Advanced search  
Pages: [1]   Go Down

Author Topic: Definite Articles (non English ones)  (Read 31361 times)

AndrewFG

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 3392
Definite Articles (non English ones)
« on: November 03, 2013, 06:00:34 am »

MC has an option to exclude definite article prefixes ("the", "a") from the sort order on various lists -- but it applies to only English definite articles -- which is not very friendly for MC's many international users!!

I think the article list must include Der, Die, Das, Ein, Eine, Le, La, Les, Un, Une, El, Los, Las, Il (to cite just a few languages), plus also prefixes that aren't strictly definite articles like Sir ("Sir Colin Davis")

So I request that the Options setting should be extended to let the user edit their own list of definite articles to exclude from sorts; the easiest way would be to enter a (comma) separated list.

Logged
Author of Whitebear Digital Media Renderer Analyser - http://www.whitebear.ch/dmra.htm
Author of Whitebear - http://www.whitebear.ch/mediaserver.htm

adamt

  • Galactic Citizen
  • ****
  • Posts: 447
Re: Definite Articles (non English ones)
« Reply #1 on: November 03, 2013, 03:02:49 pm »

I agree, this would be a helpful addition considering the many languages MC supports. 
Logged
Adam Thompson, JRiver

InflatableMouse

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 3978
Re: Definite Articles (non English ones)
« Reply #2 on: November 03, 2013, 03:43:06 pm »

De, Het, Een .. to add some dutch articles :P.
Logged

Claude Lapalme

  • World Citizen
  • ***
  • Posts: 233
Re: Definite Articles (non English ones)
« Reply #3 on: November 04, 2013, 11:42:15 am »

+1
Logged

fitbrit

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 4877
Re: Definite Articles (non English ones)
« Reply #4 on: November 04, 2013, 12:52:36 pm »

My list: French, Spanish and German at least, please.
Logged

Matt

  • Administrator
  • Citizen of the Universe
  • *****
  • Posts: 41919
  • Shoes gone again!
Re: Definite Articles (non English ones)
« Reply #5 on: November 04, 2013, 02:05:33 pm »

I'm not a fan of ignoring articles because there are too many corner cases, exceptions, etc.  My brain doesn't have the horsepower for all those rules.

But for the people that can handle it, I empathize with the request to internationalize the system.

That still leaves a big question: when you run MC in English, is 'Das Auto' under A or D?  In other words, are internationalized articles tied to the current language or is there a master list of all articles and users are expected to know that der, die, das, el, etc. are articles even if they're running in English?
Logged
Matt Ashland, JRiver Media Center

mwillems

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 5173
  • "Linux Merit Badge" Recipient
Re: Definite Articles (non English ones)
« Reply #6 on: November 04, 2013, 02:20:10 pm »

I'm not a fan of ignoring articles because there are too many corner cases, exceptions, etc.  My brain doesn't have the horsepower for all those rules.

But for the people that can handle it, I empathize with the request to internationalize the system.

That still leaves a big question: when you run MC in English, is 'Das Auto' under A or D?  In other words, are internationalized articles tied to the current language or is there a master list of all articles and users are expected to know that der, die, das, el, etc. are articles even if they're running in English?

I was wondering that myself; many English-speaking musicians use an article from another language and it just becomes a part of their band name (Les Savy Fav is always shelved in "L" at the record store).  That said, it would drive me nuts if I couldn't ignore English articles (like in the windows file system).  

Maybe the easiest solution is to allow the user to specify the articles he or she would like ignored (as OP suggested), with the default setting being the current arrangement?  Then all languages are (theoretically) supported and folks can choose the articles most relevant to them (I personally wouldn't want to exclude honorifics like "Sir" as OP suggested, but with this system you could if you wanted).
Logged

gappie

  • Regular Member
  • Citizen of the Universe
  • *****
  • Posts: 4565
Re: Definite Articles (non English ones)
« Reply #7 on: November 04, 2013, 03:09:51 pm »

I'm not a fan of ignoring articles because there are too many corner cases, exceptions, etc.  My brain doesn't have the horsepower for all those rules.

But for the people that can handle it, I empathize with the request to internationalize the system.

That still leaves a big question: when you run MC in English, is 'Das Auto' under A or D?  In other words, are internationalized articles tied to the current language or is there a master list of all articles and users are expected to know that der, die, das, el, etc. are articles even if they're running in English?
I think it would not make sense to couple it to the language settings in mc. my mc is in English (I think computers talk English), but besides having a bunch of English movies I have a lot of dutch, german and spanisch movies. I would love to ignore all the articles in those languages, not just because it looks better, it makes scrolling through the library easier.. now the letters d (die der dem das de ) and l (la le les), and others, are overcrowded.

but then again, all the articles of all languages.. not sure what will happen than. :)

but something in the options where you can add your own articles.

 :)
gab
Logged

AndrewFG

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 3392
Re: Definite Articles (non English ones)
« Reply #8 on: November 04, 2013, 03:41:40 pm »

That still leaves a big question: when you run MC in English, is 'Das Auto' under A or D?  In other words, are internationalized articles tied to the current language or is there a master list of all articles and users are expected to know that der, die, das, el, etc. are articles even if they're running in English?

You just proved my point! This is exactly the reason why the setting must be freely user customizable...

For me, (I have some knowledge of languages), I want to see

  • Die Fledermaus under "F"
  • La Traviata under "T"
  • Sir Paul McCartney under "P"
  • Les Pecheurs de Perle under "P"
  • And... admittedly... Les Paul under "L"

...but other people might see this differently...

Therefore I ask that Options shall give the user a text edit field where they can personally enter a space (or comma) delimited list of all definite articles that they personally wish to exclude.

Logged
Author of Whitebear Digital Media Renderer Analyser - http://www.whitebear.ch/dmra.htm
Author of Whitebear - http://www.whitebear.ch/mediaserver.htm

Matt

  • Administrator
  • Citizen of the Universe
  • *****
  • Posts: 41919
  • Shoes gone again!
Re: Definite Articles (non English ones)
« Reply #9 on: November 04, 2013, 03:44:19 pm »

  • Les Pecheurs de Perle under "P"
  • And... admittedly... Les Paul under "L"

My brain just melted :P

I can't imagine how you'd do that without cratering performance.  String compares for sorting is incredibly mission critical for performance, so we have to be careful.
Logged
Matt Ashland, JRiver Media Center

JimH

  • Administrator
  • Citizen of the Universe
  • *****
  • Posts: 71333
  • Where did I put my teeth?
Re: Definite Articles (non English ones)
« Reply #10 on: November 04, 2013, 03:55:15 pm »

My brain just melted

Matt's brain:

Logged

mojave

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 3732
  • Requires "iTunes or better" so I installed JRiver
Re: Definite Articles (non English ones)
« Reply #11 on: November 04, 2013, 04:03:14 pm »

Is this Matt?  ;D

Logged

fitbrit

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 4877
Re: Definite Articles (non English ones)
« Reply #12 on: November 04, 2013, 04:16:47 pm »

I see Matt's point. What if an article in one language is a regular word in another?
"Die Another Day"
Logged

AndrewFG

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 3392
Re: Definite Articles (non English ones)
« Reply #13 on: November 04, 2013, 04:21:49 pm »

My brain just melted :P

I can't imagine how you'd do that without cratering performance.  String compares for sorting is incredibly mission critical for performance, so we have to be careful.

Hmmm. (I know what you mean. But nevertheless you have to admit that it is a verging on wimpy answer. Real coders don't ...)

Your (ex) competition did it fine (see image attached). And their application is written in Perl (albeit admittedly compiled rather than interpreted)...

PS I don't know how the innards of MC works, but do you really search and index your lists in real time on demand? Or do you prepare in advance a lookup / index / hash, which obviously would be much faster...

PPS and anyway you are already searching for a few prefix words (The, A, An) aren't you, so what's the big deal if you have to search for a few more?

PPPS (below the belt warning) why don't you ask Hendrik for his feedback on this ;)

Logged
Author of Whitebear Digital Media Renderer Analyser - http://www.whitebear.ch/dmra.htm
Author of Whitebear - http://www.whitebear.ch/mediaserver.htm

AndrewFG

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 3392
Re: Definite Articles (non English ones)
« Reply #14 on: November 04, 2013, 04:25:35 pm »

I see Matt's point. What if an article in one language is a regular word in another?
"Die Another Day"

Indeed. That is why I say it must be a user configurable option !!
Logged
Author of Whitebear Digital Media Renderer Analyser - http://www.whitebear.ch/dmra.htm
Author of Whitebear - http://www.whitebear.ch/mediaserver.htm

Matt

  • Administrator
  • Citizen of the Universe
  • *****
  • Posts: 41919
  • Shoes gone again!
Re: Definite Articles (non English ones)
« Reply #15 on: November 04, 2013, 04:31:42 pm »

Your (ex) competition did it fine (see image attached). And their application is written in Perl (albeit admittedly compiled rather than interpreted)...

You said:
Les Pecheurs de Perle > P
Les Paul > L

How does your screenshot with a list of articles accomplish that?
Logged
Matt Ashland, JRiver Media Center

AndrewFG

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 3392
Re: Definite Articles (non English ones)
« Reply #16 on: November 04, 2013, 04:36:39 pm »

You said:
Les Pecheurs de Perle > P
Les Paul > L
How does your screenshot with a list of articles accomplish that?

It doesn't :)

That was my aside comment, just to prove that there are no hard answers...

I would like to have a choice, based on what I know is in my library, whether or not (for example) the word Les should be included or excluded from the sort order...

Logged
Author of Whitebear Digital Media Renderer Analyser - http://www.whitebear.ch/dmra.htm
Author of Whitebear - http://www.whitebear.ch/mediaserver.htm

Matt

  • Administrator
  • Citizen of the Universe
  • *****
  • Posts: 41919
  • Shoes gone again!
Re: Definite Articles (non English ones)
« Reply #17 on: November 04, 2013, 04:58:39 pm »

Next build:
NEW: The option to move articles, ignore articles, etc. uses a list that includes English, Spanish, German, and French articles.
NEW: Added the ability for 'Clean File Properties' to move articles back to the front of a value (i.e. change 'Beatles, The' to 'The Beatles').

Just doing this change was a fair bit of work, because some of the article functions were hand-tuned with regard to how they checked for articles.  They needed to be generalized without sacrificing (much) performance.

Someday later we'll talk about exceptions, allowing user customization, etc.
Logged
Matt Ashland, JRiver Media Center

fitbrit

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 4877
Re: Definite Articles (non English ones)
« Reply #18 on: November 04, 2013, 05:53:22 pm »

Next build:
NEW: The option to move articles, ignore articles, etc. uses a list that includes English, Spanish, German, and French articles.
NEW: Added the ability for 'Clean File Properties' to move articles back to the front of a value (i.e. change 'Beatles, The' to 'The Beatles').

Just doing this change was a fair bit of work, because some of the article functions were hand-tuned with regard to how they checked for articles.  They needed to be generalized without sacrificing (much) performance.

Someday later we'll talk about exceptions, allowing user customization, etc.

Thanks, Matt. About half a dozen of my files will be organised better than ever.
Logged

AndrewFG

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 3392
Re: Definite Articles (non English ones)
« Reply #19 on: November 05, 2013, 02:03:03 am »

Next build:
NEW: The option to move articles, ignore articles, etc. uses a list that includes English, Spanish, German, and French articles.
NEW: Added the ability for 'Clean File Properties' to move articles back to the front of a value (i.e. change 'Beatles, The' to 'The Beatles').

Just doing this change was a fair bit of work, because some of the article functions were hand-tuned with regard to how they checked for articles.  They needed to be generalized without sacrificing (much) performance.

Mega!  Many thanks for this.

Someday later we'll talk about exceptions, allowing user customization, etc.

One way to solve this is to use the non breaking space delimiter (Unicode 00A0) instead of the breaking space delimiter (Unicode 0020).

You could use 00A0 to separate the words in Die Hard, and 0020 to separate the words in Die Fledermaus...

Logged
Author of Whitebear Digital Media Renderer Analyser - http://www.whitebear.ch/dmra.htm
Author of Whitebear - http://www.whitebear.ch/mediaserver.htm

Arindelle

  • Citizen of the Universe
  • *****
  • Posts: 2772
Re: Definite Articles (non English ones)
« Reply #20 on: November 05, 2013, 09:23:07 am »

euh my two cents for French if not too late ... Le, La, Les ok maybe, but not Un, Une -- there are two many exceptions as un or une means both the number 1 and the article a.   And then what about "L" -- seems pretty complicated to me

This has never been on my "want-to-change" list (like more mouse functions in theater view  :P) because one usually associates articles with nouns eg. translate milk = le lait (ok lait (m.) in a dictionary but ....), assume this would be similar for all romance languages at least. Personally, I never would think of Boheme in a search - its LA Bohème.

Anyway thats just me after 25 years in country -  if at least there is an option to not use this or pick and choose ... why not.

Seems more for english speakers than for other language speakers IMHO. Que pensez-vous les franchouillards?  ;D


Logged

AndrewFG

  • MC Beta Team
  • Citizen of the Universe
  • *****
  • Posts: 3392
Re: Definite Articles (non English ones)
« Reply #21 on: November 05, 2013, 11:25:31 am »

One way to solve this is to use the non breaking space delimiter (Unicode 00A0) instead of the breaking space delimiter (Unicode 0020).
You could use 00A0 to separate the words in Die Hard, and 0020 to separate the words in Die Fledermaus...

So if you create an OnKeyPress handler on all of MC's edit boxes for sortable tags, then in case of the <space> key being pressed, if (say) the <control> key also down, then the handler could substitute the non breaking space character (Unicode 00A0) in the text instead of the regular space character (Unicode 0020).

EDIT:  I subsequently discovered that in Windows you can type a non breaking space by holding down ALT and typing 0160 (the character code in decimal) on the numeric keypad.

Logged
Author of Whitebear Digital Media Renderer Analyser - http://www.whitebear.ch/dmra.htm
Author of Whitebear - http://www.whitebear.ch/mediaserver.htm

JimH

  • Administrator
  • Citizen of the Universe
  • *****
  • Posts: 71333
  • Where did I put my teeth?
Re: Definite Articles (non English ones)
« Reply #22 on: December 17, 2013, 11:16:04 am »

Bump
Logged

john greenwood

  • World Citizen
  • ***
  • Posts: 108
Re: Definite Articles (non English ones)
« Reply #23 on: August 26, 2014, 05:16:14 pm »

MC has an option to exclude definite article prefixes ("the", "a") from the sort order on various lists -- but it applies to only English definite articles -- which is not very friendly for MC's many international users!!

I think the article list must include Der, Die, Das, Ein, Eine, Le, La, Les, Un, Une, El, Los, Las, Il (to cite just a few languages), plus also prefixes that aren't strictly definite articles like Sir ("Sir Colin Davis")

So I request that the Options setting should be extended to let the user edit their own list of definite articles to exclude from sorts; the easiest way would be to enter a (comma) separated list.



Could someone tell me where I can find this list and modify it?
Logged

Arindelle

  • Citizen of the Universe
  • *****
  • Posts: 2772
Re: Definite Articles (non English ones)
« Reply #24 on: August 26, 2014, 05:18:56 pm »

Could someone tell me where I can find this list and modify it?

sure. Options=>Tree & View=> Sorting
Logged

john greenwood

  • World Citizen
  • ***
  • Posts: 108
Re: Definite Articles (non English ones)
« Reply #25 on: August 26, 2014, 05:50:00 pm »

Thanks
Logged
Pages: [1]   Go Up