Compare two strings based on ASCII dictionary order

6 vues (au cours des 30 derniers jours)
Brandon Kuczenski
Brandon Kuczenski le 25 Mai 2012
I know the sort builtin function will sort cell arrays of strings in ascii dictionary order. But how may I simply compare two strings to determine which is first (by dictionary order) ? This must be a constituent part of the sort routine, but I cannot find a way to do it. (and sort is builtin so I cannot inspect it).

Réponse acceptée

Oleg Komarov
Oleg Komarov le 25 Mai 2012
[trash,idx] = sort({'abc';'a'})
Then just look at idx(1)
  2 commentaires
Geoff
Geoff le 25 Mai 2012
Oh, that's nice and easy =) I would use [~, idx] though, rather than create a variable called 'trash'.
Brandon Kuczenski
Brandon Kuczenski le 25 Mai 2012
This plus diff(idx) gives a usable result. it does not detect equal strings, however.

Connectez-vous pour commenter.

Plus de réponses (2)

Geoff
Geoff le 25 Mai 2012
Yeah this is a real failing of the strcmp function in my opinion. That function originated in C, and would tell you whether a string was less, equal or greater than another. But it was probably too non-intuitive for MatLab users because it returned zero if the strings were equal.
Basically, what the C function does is subtract the strings character by character. So you can do that here:
function cmp = cstrcmp( a, b )
% Force the strings to equal length
x = char({a;b});
% Subtract one from the other
d = x(1,:) - x(2,:);
% Remove zero entries
d(~d) = [];
if isempty(d)
cmp = 0;
else
cmp = d(1);
end
end
The output is:
a == b : 0
a > b : positive
a < b : negative
There are probably more efficient ways to do this in MatLab. I just stuck to the easy matrix operations.
  2 commentaires
Brandon Kuczenski
Brandon Kuczenski le 25 Mai 2012
char({a;b})- now that is useful. so is ~ instead of Trash; and I like d(~d) = [] as well.. thank you!
Geoff
Geoff le 25 Mai 2012
Haha yeah that logic negation of non-logic values is an old C habit. It's technically not very good programming practice (because it's not as readable as "d==0"), but MatLab does document the behaviour. Use at your own peril =)

Connectez-vous pour commenter.


Junaid
Junaid le 25 Mai 2012
Can you give one example. As I understand, you can do it by compare operator.
a = 'abd';
b = 'abc';
a <= b
output is [1 1 0] where 0 indicates that some character in b comes before in by dictionary order.
  2 commentaires
Brandon Kuczenski
Brandon Kuczenski le 25 Mai 2012
That only works if the strings are the same length. In order to do the comparison in general, I have to first test to see which string is longer and either truncate it or pad the other. I suppose that is not too much trouble, but I am surprised that it must be done manually.
Brandon Kuczenski
Brandon Kuczenski le 25 Mai 2012
Also, the single comparison is insufficient to determine which string comes first.
a='hellob'
b='hellbo'
According to a dictionary test, a>b. In order for me to know that, I would need to do both comparisons:
a>=b
>> a>=b
ans =
1 1 1 1 1 0
>> b>=a
ans =
1 1 1 1 0 1
>>
and then see which one has the earlier zero.

Connectez-vous pour commenter.

Catégories

En savoir plus sur Data Type Identification dans Help Center et File Exchange

Community Treasure Hunt

Find the treasures in MATLAB Central and discover how the community can help you!

Start Hunting!

Translated by