the average of multiple Probability density functions
18 vues (au cours des 30 derniers jours)
Afficher commentaires plus anciens
Hallo,
I have a cell array of {1000,1} pds, they are all the same distribution, but they differ slightly in the parameter values
Does any one know how to get the average of these pds? I have got them in iterative process and would like now to know the mean or average of them
I though of taking the mean of the parameter values in the pds in for loop but I am truggling with the right way to do it because I keep getting errors.
it would be helpful if some one gives me a hint :)
I have attached the cell array (Temp) in mat file
Thanks in advanced!
5 commentaires
Image Analyst
le 5 Déc 2019
J, it should work regardless of what toolboxes you have
s=load('temp.mat')
cellArray = s.Temp
s =
struct with fields:
Temp: {1000×1 cell}
cellArray =
1000×1 cell array
{1×1 prob.GeneralizedParetoDistribution}
{1×1 prob.GeneralizedParetoDistribution}
{1×1 prob.GeneralizedParetoDistribution}
{1×1 prob.GeneralizedParetoDistribution}
etc.
Jeff Miller
le 5 Déc 2019
What do you mean by "the average of the probability distributions"?
For example, suppose one distribution is normal(0,1) and another distribution is normal(10,2). Is the average normal(5,1.5)? Or is the average bimodal with one mode around 0 and another around 10? Or is the average distribution something else?
Réponse acceptée
Ridwan Alam
le 5 Déc 2019
The idea of "average" is ill-defined for pdfs.
In your case, all the 1000 distridutions are generalized paretos. Your theta is fixed in all pdfs, so you can certainly take the average of the sigma's and the k's. But what meaning/significance those "averages" carry, you need to come up with that.
If you are looking for the average expected value, you can calculate the mean (expected value) for each pareto and then average those. mean = theta + (sigma/(1-k)). ref: https://en.wikipedia.org/wiki/Generalized_Pareto_distribution
Also, if you are looking for the overall distribution parameters, you can combine the input data [available in Temp{}.InputData.data] and use gppdf().
Hope this helps.
2 commentaires
Ridwan Alam
le 6 Déc 2019
Modifié(e) : Ridwan Alam
le 6 Déc 2019
Rima, as I mentioned one approach could be to stack all the inputdata from all the pdfs.
allInputData=[];
for i = 1:length(Temp)
allInputData = [allInputData; Temp{i}.InputData.data];
end
% in your case, theta is fixed
newTheta = Temp{1}.theta;
paramhat = gpfit(allInputData-newTheta);
newK = paramhat(1);
newSigma = paramhat(2);
Hope this helps.
Please accept the response as an answer, and vote up if you liked the conversation.
Plus de réponses (0)
Voir également
Catégories
En savoir plus sur Creating and Concatenating Matrices dans Help Center et File Exchange
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!