Difference between “probability density function” and “probability distribution function”? The 2019 Stack Overflow Developer Survey Results Are InDifference between density and distribution [in formal mathematical terms]Probability distribution vs. probability mass function (PMF): what is the difference between the terms?Distinguish Normal Distribution, Gaussian Distribution and Normalised Gaussian Distribution?Confusion between probability distribution function and probability density functionProbability density/mass functionProbability mass function and Probability density functionDifference between Probability and Probability Densityprobability density function and cumulative distribution function.Probability measure, probability density function or probability event ? Are they different?Difference Between joint probability distribution and conditional probability distribution?What is the difference between a probability mass function and discrete probability distribution?What is the difference between a joint distribution function and the joint density function.Probability distribution vs. probability mass function / Probability density function terms: what's the difference

Can withdrawing asylum be illegal?

Is it ok to offer lower paid work as a trial period before negotiating for a full-time job?

Relationship between Gromov-Witten and Taubes' Gromov invariant

How to support a colleague who finds meetings extremely tiring?

What do I do when my TA workload is more than expected?

The difference between dialogue marks

What could be the right powersource for 15 seconds lifespan disposable giant chainsaw?

What to do when moving next to a bird sanctuary with a loosely-domesticated cat?

What is the meaning of Triage in Cybersec world?

How do you keep chess fun when your opponent constantly beats you?

A word that means fill it to the required quantity

How to translate "being like"?

Why are there uneven bright areas in this photo of black hole?

Why couldn't they take pictures of a closer black hole?

Why doesn't UInt have a toDouble()?

How to charge AirPods to keep battery healthy?

Is it ethical to upload a automatically generated paper to a non peer-reviewed site as part of a larger research?

Why can't devices on different VLANs, but on the same subnet, communicate?

Is Cinnamon a desktop environment or a window manager? (Or both?)

Is it okay to consider publishing in my first year of PhD?

Will it cause any balance problems to have PCs level up and gain the benefits of a long rest mid-fight?

How can I add encounters in the Lost Mine of Phandelver campaign without giving PCs too much XP?

What do hard-Brexiteers want with respect to the Irish border?

How can I define good in a religion that claims no moral authority?



Difference between “probability density function” and “probability distribution function”?



The 2019 Stack Overflow Developer Survey Results Are InDifference between density and distribution [in formal mathematical terms]Probability distribution vs. probability mass function (PMF): what is the difference between the terms?Distinguish Normal Distribution, Gaussian Distribution and Normalised Gaussian Distribution?Confusion between probability distribution function and probability density functionProbability density/mass functionProbability mass function and Probability density functionDifference between Probability and Probability Densityprobability density function and cumulative distribution function.Probability measure, probability density function or probability event ? Are they different?Difference Between joint probability distribution and conditional probability distribution?What is the difference between a probability mass function and discrete probability distribution?What is the difference between a joint distribution function and the joint density function.Probability distribution vs. probability mass function / Probability density function terms: what's the difference










31












$begingroup$


I am studying for my statistics exam, and have to know a lot of theory. My question is:



Whats the difference between probability density function and probability distribution function?










share|cite|improve this question











$endgroup$







  • 2




    $begingroup$
    The density (when it exists) is the derivative of the distribution function.
    $endgroup$
    – Joel Cohen
    Jul 27 '12 at 13:31






  • 1




    $begingroup$
    You mean, "Difference between Probability density function and cumulative distribution function?"?
    $endgroup$
    – Matt O'Brien
    Feb 5 '14 at 21:08















31












$begingroup$


I am studying for my statistics exam, and have to know a lot of theory. My question is:



Whats the difference between probability density function and probability distribution function?










share|cite|improve this question











$endgroup$







  • 2




    $begingroup$
    The density (when it exists) is the derivative of the distribution function.
    $endgroup$
    – Joel Cohen
    Jul 27 '12 at 13:31






  • 1




    $begingroup$
    You mean, "Difference between Probability density function and cumulative distribution function?"?
    $endgroup$
    – Matt O'Brien
    Feb 5 '14 at 21:08













31












31








31


20



$begingroup$


I am studying for my statistics exam, and have to know a lot of theory. My question is:



Whats the difference between probability density function and probability distribution function?










share|cite|improve this question











$endgroup$




I am studying for my statistics exam, and have to know a lot of theory. My question is:



Whats the difference between probability density function and probability distribution function?







probability probability-distributions terminology






share|cite|improve this question















share|cite|improve this question













share|cite|improve this question




share|cite|improve this question








edited Oct 14 '15 at 4:56







user147263

















asked Jul 27 '12 at 13:28









Le ChifreLe Chifre

5923821




5923821







  • 2




    $begingroup$
    The density (when it exists) is the derivative of the distribution function.
    $endgroup$
    – Joel Cohen
    Jul 27 '12 at 13:31






  • 1




    $begingroup$
    You mean, "Difference between Probability density function and cumulative distribution function?"?
    $endgroup$
    – Matt O'Brien
    Feb 5 '14 at 21:08












  • 2




    $begingroup$
    The density (when it exists) is the derivative of the distribution function.
    $endgroup$
    – Joel Cohen
    Jul 27 '12 at 13:31






  • 1




    $begingroup$
    You mean, "Difference between Probability density function and cumulative distribution function?"?
    $endgroup$
    – Matt O'Brien
    Feb 5 '14 at 21:08







2




2




$begingroup$
The density (when it exists) is the derivative of the distribution function.
$endgroup$
– Joel Cohen
Jul 27 '12 at 13:31




$begingroup$
The density (when it exists) is the derivative of the distribution function.
$endgroup$
– Joel Cohen
Jul 27 '12 at 13:31




1




1




$begingroup$
You mean, "Difference between Probability density function and cumulative distribution function?"?
$endgroup$
– Matt O'Brien
Feb 5 '14 at 21:08




$begingroup$
You mean, "Difference between Probability density function and cumulative distribution function?"?
$endgroup$
– Matt O'Brien
Feb 5 '14 at 21:08










3 Answers
3






active

oldest

votes


















10












$begingroup$

The relation between the probability density funtion $f$ and the cumulative distribution function $F$ is
$$
F(k) = sum_i le k f(i)
$$
if $f$ is discrete and
$$
F(x) = int_y le x f(y),dy
$$
if $f$ is continuous.






share|cite|improve this answer









$endgroup$












  • $begingroup$
    what is meant by discrete and continuous?
    $endgroup$
    – Le Chifre
    Jul 27 '12 at 13:40










  • $begingroup$
    @maximus if the variable ranges over a discrete or continuous set of values. So if you're rolling a die, you have $1,2,3,4,5,6$, which is discrete. If you're picking a random point on a line, then your set is, say, the interval $[0,L]$ which is continuous.
    $endgroup$
    – Robert Mastragostino
    Jul 27 '12 at 13:45










  • $begingroup$
    @maximus For example, when flipping a coin or rolling a dice the outcome is discrete whereas measuring the time until the bus arrives at a bus stop is continuous.
    $endgroup$
    – August Karlstrom
    Jul 27 '12 at 13:47










  • $begingroup$
    so discrete is when you can count it! and continuous is when there is much more probability in it? Is this description right or wrong? Pls correct me!
    $endgroup$
    – Le Chifre
    Jul 27 '12 at 13:50










  • $begingroup$
    @maximus That's correct though you may have to count forever. Check out the concept of a countable set for an exact definition.
    $endgroup$
    – August Karlstrom
    Jul 27 '12 at 14:12



















32












$begingroup$

Distribution Function



  1. The probability distribution function / probability function has ambiguous definition. They may be referred to:

    • Probability density function (PDF)

    • Cumulative distribution function (CDF)

    • or probability mass function (PMF) (statement from Wikipedia)


  2. But what confirm is:

    • Discrete case: Probability Mass Function (PMF)

    • Continuous case: Probability Density Function (PDF)

    • Both cases: Cumulative distribution function (CDF)


  3. Probability at certain $x$ value, $P(X = x)$ can be directly obtained in:

    • PMF for discrete case

    • PDF for continuous case


  4. Probability for values less than $x$, $P(X < x)$ or Probability for values within a range from $a$ to $b$, $P(a < X < b)$ can be directly obtained in:

    • CDF for both discrete / continuous case


  5. Distribution function is referred to CDF or Cumulative Frequency Function (see this)

In terms of Acquisition and Plot Generation Method



  1. Collected data appear as discrete when:

    • The measurement of a subject is naturally discrete type, such as numbers resulted from dice rolled, count of people.

    • The measurement is digitized machine data, which has no intermediate values between quantized levels due to sampling process.

    • In later case, when resolution higher, the measurement is closer to analog/continuous signal/variable.


  2. Way of generate a PMF from discrete data:

    • Plot a histogram of the data for all the $x$'s, the $y$-axis is the frequency or quantity at every $x$.

    • Scale the $y$-axis by dividing with total number of data collected (data size) $longrightarrow$ and this is called PMF.


  3. Way of generate a PDF from discrete / continuous data:

    • Find a continuous equation that models the collected data, let say normal distribution equation.

    • Calculate the parameters required in the equation from the collected data. For example, parameters for normal distribution equation are mean and standard deviation. Calculate them from collected data.

    • Based on the parameters, plot the equation with continuous $x$-value $longrightarrow$ that is called PDF.


  4. How to generate a CDF:

    • In discrete case, CDF accumulates the $y$ values in PMF at each discrete $x$ and less than $x$. Repeat this for every $x$. The final plot is a monotonically increasing until $1$ in the last $x$ $longrightarrow$ this is called discrete CDF.

    • In continuous case, integrate PDF over $x$; the result is a continuous CDF.


Why PMF, PDF and CDF?



  1. PMF is preferred when

    • Probability at every $x$ value is interest of study. This makes sense when studying a discrete data - such as we interest to probability of getting certain number from a dice roll.


  2. PDF is preferred when

    • We wish to model a collected data with a continuous function, by using few parameters such as mean to speculate the population distribution.


  3. CDF is preferred when

    • Cumulative probability in a range is point of interest.

    • Especially in the case of continuous data, CDF much makes sense than PDF - e.g., probability of students' height less than $170$ cm (CDF) is much informative than the probability at exact $170$ cm (PDF).






share|cite|improve this answer











$endgroup$




















    0












    $begingroup$

    Some abuse of language exists in these terms, which can vary. Below is a common usage.



    In the continuous case (density):



    (continuous) probability distribution function = probability density function = density function
    (continuous) probability distribution = density


    In the discrete case (mass/distribution):



    (discrete) probability distribution function = probability mass function
    (discrete) probability distribution = distribution


    Oddly enough, you may never see a probability mass function called a mass function or a distribution function, nor may you see a discrete probability distribution called a mass. I am sure there is some historical reason as to why. As they say, das war schon immer so und wird auch immer so bleiben.






    share|cite|improve this answer









    $endgroup$











      protected by Zev Chonoles Aug 23 '16 at 9:24



      Thank you for your interest in this question.
      Because it has attracted low-quality or spam answers that had to be removed, posting an answer now requires 10 reputation on this site (the association bonus does not count).



      Would you like to answer one of these unanswered questions instead?














      3 Answers
      3






      active

      oldest

      votes








      3 Answers
      3






      active

      oldest

      votes









      active

      oldest

      votes






      active

      oldest

      votes









      10












      $begingroup$

      The relation between the probability density funtion $f$ and the cumulative distribution function $F$ is
      $$
      F(k) = sum_i le k f(i)
      $$
      if $f$ is discrete and
      $$
      F(x) = int_y le x f(y),dy
      $$
      if $f$ is continuous.






      share|cite|improve this answer









      $endgroup$












      • $begingroup$
        what is meant by discrete and continuous?
        $endgroup$
        – Le Chifre
        Jul 27 '12 at 13:40










      • $begingroup$
        @maximus if the variable ranges over a discrete or continuous set of values. So if you're rolling a die, you have $1,2,3,4,5,6$, which is discrete. If you're picking a random point on a line, then your set is, say, the interval $[0,L]$ which is continuous.
        $endgroup$
        – Robert Mastragostino
        Jul 27 '12 at 13:45










      • $begingroup$
        @maximus For example, when flipping a coin or rolling a dice the outcome is discrete whereas measuring the time until the bus arrives at a bus stop is continuous.
        $endgroup$
        – August Karlstrom
        Jul 27 '12 at 13:47










      • $begingroup$
        so discrete is when you can count it! and continuous is when there is much more probability in it? Is this description right or wrong? Pls correct me!
        $endgroup$
        – Le Chifre
        Jul 27 '12 at 13:50










      • $begingroup$
        @maximus That's correct though you may have to count forever. Check out the concept of a countable set for an exact definition.
        $endgroup$
        – August Karlstrom
        Jul 27 '12 at 14:12
















      10












      $begingroup$

      The relation between the probability density funtion $f$ and the cumulative distribution function $F$ is
      $$
      F(k) = sum_i le k f(i)
      $$
      if $f$ is discrete and
      $$
      F(x) = int_y le x f(y),dy
      $$
      if $f$ is continuous.






      share|cite|improve this answer









      $endgroup$












      • $begingroup$
        what is meant by discrete and continuous?
        $endgroup$
        – Le Chifre
        Jul 27 '12 at 13:40










      • $begingroup$
        @maximus if the variable ranges over a discrete or continuous set of values. So if you're rolling a die, you have $1,2,3,4,5,6$, which is discrete. If you're picking a random point on a line, then your set is, say, the interval $[0,L]$ which is continuous.
        $endgroup$
        – Robert Mastragostino
        Jul 27 '12 at 13:45










      • $begingroup$
        @maximus For example, when flipping a coin or rolling a dice the outcome is discrete whereas measuring the time until the bus arrives at a bus stop is continuous.
        $endgroup$
        – August Karlstrom
        Jul 27 '12 at 13:47










      • $begingroup$
        so discrete is when you can count it! and continuous is when there is much more probability in it? Is this description right or wrong? Pls correct me!
        $endgroup$
        – Le Chifre
        Jul 27 '12 at 13:50










      • $begingroup$
        @maximus That's correct though you may have to count forever. Check out the concept of a countable set for an exact definition.
        $endgroup$
        – August Karlstrom
        Jul 27 '12 at 14:12














      10












      10








      10





      $begingroup$

      The relation between the probability density funtion $f$ and the cumulative distribution function $F$ is
      $$
      F(k) = sum_i le k f(i)
      $$
      if $f$ is discrete and
      $$
      F(x) = int_y le x f(y),dy
      $$
      if $f$ is continuous.






      share|cite|improve this answer









      $endgroup$



      The relation between the probability density funtion $f$ and the cumulative distribution function $F$ is
      $$
      F(k) = sum_i le k f(i)
      $$
      if $f$ is discrete and
      $$
      F(x) = int_y le x f(y),dy
      $$
      if $f$ is continuous.







      share|cite|improve this answer












      share|cite|improve this answer



      share|cite|improve this answer










      answered Jul 27 '12 at 13:39









      August KarlstromAugust Karlstrom

      26726




      26726











      • $begingroup$
        what is meant by discrete and continuous?
        $endgroup$
        – Le Chifre
        Jul 27 '12 at 13:40










      • $begingroup$
        @maximus if the variable ranges over a discrete or continuous set of values. So if you're rolling a die, you have $1,2,3,4,5,6$, which is discrete. If you're picking a random point on a line, then your set is, say, the interval $[0,L]$ which is continuous.
        $endgroup$
        – Robert Mastragostino
        Jul 27 '12 at 13:45










      • $begingroup$
        @maximus For example, when flipping a coin or rolling a dice the outcome is discrete whereas measuring the time until the bus arrives at a bus stop is continuous.
        $endgroup$
        – August Karlstrom
        Jul 27 '12 at 13:47










      • $begingroup$
        so discrete is when you can count it! and continuous is when there is much more probability in it? Is this description right or wrong? Pls correct me!
        $endgroup$
        – Le Chifre
        Jul 27 '12 at 13:50










      • $begingroup$
        @maximus That's correct though you may have to count forever. Check out the concept of a countable set for an exact definition.
        $endgroup$
        – August Karlstrom
        Jul 27 '12 at 14:12

















      • $begingroup$
        what is meant by discrete and continuous?
        $endgroup$
        – Le Chifre
        Jul 27 '12 at 13:40










      • $begingroup$
        @maximus if the variable ranges over a discrete or continuous set of values. So if you're rolling a die, you have $1,2,3,4,5,6$, which is discrete. If you're picking a random point on a line, then your set is, say, the interval $[0,L]$ which is continuous.
        $endgroup$
        – Robert Mastragostino
        Jul 27 '12 at 13:45










      • $begingroup$
        @maximus For example, when flipping a coin or rolling a dice the outcome is discrete whereas measuring the time until the bus arrives at a bus stop is continuous.
        $endgroup$
        – August Karlstrom
        Jul 27 '12 at 13:47










      • $begingroup$
        so discrete is when you can count it! and continuous is when there is much more probability in it? Is this description right or wrong? Pls correct me!
        $endgroup$
        – Le Chifre
        Jul 27 '12 at 13:50










      • $begingroup$
        @maximus That's correct though you may have to count forever. Check out the concept of a countable set for an exact definition.
        $endgroup$
        – August Karlstrom
        Jul 27 '12 at 14:12
















      $begingroup$
      what is meant by discrete and continuous?
      $endgroup$
      – Le Chifre
      Jul 27 '12 at 13:40




      $begingroup$
      what is meant by discrete and continuous?
      $endgroup$
      – Le Chifre
      Jul 27 '12 at 13:40












      $begingroup$
      @maximus if the variable ranges over a discrete or continuous set of values. So if you're rolling a die, you have $1,2,3,4,5,6$, which is discrete. If you're picking a random point on a line, then your set is, say, the interval $[0,L]$ which is continuous.
      $endgroup$
      – Robert Mastragostino
      Jul 27 '12 at 13:45




      $begingroup$
      @maximus if the variable ranges over a discrete or continuous set of values. So if you're rolling a die, you have $1,2,3,4,5,6$, which is discrete. If you're picking a random point on a line, then your set is, say, the interval $[0,L]$ which is continuous.
      $endgroup$
      – Robert Mastragostino
      Jul 27 '12 at 13:45












      $begingroup$
      @maximus For example, when flipping a coin or rolling a dice the outcome is discrete whereas measuring the time until the bus arrives at a bus stop is continuous.
      $endgroup$
      – August Karlstrom
      Jul 27 '12 at 13:47




      $begingroup$
      @maximus For example, when flipping a coin or rolling a dice the outcome is discrete whereas measuring the time until the bus arrives at a bus stop is continuous.
      $endgroup$
      – August Karlstrom
      Jul 27 '12 at 13:47












      $begingroup$
      so discrete is when you can count it! and continuous is when there is much more probability in it? Is this description right or wrong? Pls correct me!
      $endgroup$
      – Le Chifre
      Jul 27 '12 at 13:50




      $begingroup$
      so discrete is when you can count it! and continuous is when there is much more probability in it? Is this description right or wrong? Pls correct me!
      $endgroup$
      – Le Chifre
      Jul 27 '12 at 13:50












      $begingroup$
      @maximus That's correct though you may have to count forever. Check out the concept of a countable set for an exact definition.
      $endgroup$
      – August Karlstrom
      Jul 27 '12 at 14:12





      $begingroup$
      @maximus That's correct though you may have to count forever. Check out the concept of a countable set for an exact definition.
      $endgroup$
      – August Karlstrom
      Jul 27 '12 at 14:12












      32












      $begingroup$

      Distribution Function



      1. The probability distribution function / probability function has ambiguous definition. They may be referred to:

        • Probability density function (PDF)

        • Cumulative distribution function (CDF)

        • or probability mass function (PMF) (statement from Wikipedia)


      2. But what confirm is:

        • Discrete case: Probability Mass Function (PMF)

        • Continuous case: Probability Density Function (PDF)

        • Both cases: Cumulative distribution function (CDF)


      3. Probability at certain $x$ value, $P(X = x)$ can be directly obtained in:

        • PMF for discrete case

        • PDF for continuous case


      4. Probability for values less than $x$, $P(X < x)$ or Probability for values within a range from $a$ to $b$, $P(a < X < b)$ can be directly obtained in:

        • CDF for both discrete / continuous case


      5. Distribution function is referred to CDF or Cumulative Frequency Function (see this)

      In terms of Acquisition and Plot Generation Method



      1. Collected data appear as discrete when:

        • The measurement of a subject is naturally discrete type, such as numbers resulted from dice rolled, count of people.

        • The measurement is digitized machine data, which has no intermediate values between quantized levels due to sampling process.

        • In later case, when resolution higher, the measurement is closer to analog/continuous signal/variable.


      2. Way of generate a PMF from discrete data:

        • Plot a histogram of the data for all the $x$'s, the $y$-axis is the frequency or quantity at every $x$.

        • Scale the $y$-axis by dividing with total number of data collected (data size) $longrightarrow$ and this is called PMF.


      3. Way of generate a PDF from discrete / continuous data:

        • Find a continuous equation that models the collected data, let say normal distribution equation.

        • Calculate the parameters required in the equation from the collected data. For example, parameters for normal distribution equation are mean and standard deviation. Calculate them from collected data.

        • Based on the parameters, plot the equation with continuous $x$-value $longrightarrow$ that is called PDF.


      4. How to generate a CDF:

        • In discrete case, CDF accumulates the $y$ values in PMF at each discrete $x$ and less than $x$. Repeat this for every $x$. The final plot is a monotonically increasing until $1$ in the last $x$ $longrightarrow$ this is called discrete CDF.

        • In continuous case, integrate PDF over $x$; the result is a continuous CDF.


      Why PMF, PDF and CDF?



      1. PMF is preferred when

        • Probability at every $x$ value is interest of study. This makes sense when studying a discrete data - such as we interest to probability of getting certain number from a dice roll.


      2. PDF is preferred when

        • We wish to model a collected data with a continuous function, by using few parameters such as mean to speculate the population distribution.


      3. CDF is preferred when

        • Cumulative probability in a range is point of interest.

        • Especially in the case of continuous data, CDF much makes sense than PDF - e.g., probability of students' height less than $170$ cm (CDF) is much informative than the probability at exact $170$ cm (PDF).






      share|cite|improve this answer











      $endgroup$

















        32












        $begingroup$

        Distribution Function



        1. The probability distribution function / probability function has ambiguous definition. They may be referred to:

          • Probability density function (PDF)

          • Cumulative distribution function (CDF)

          • or probability mass function (PMF) (statement from Wikipedia)


        2. But what confirm is:

          • Discrete case: Probability Mass Function (PMF)

          • Continuous case: Probability Density Function (PDF)

          • Both cases: Cumulative distribution function (CDF)


        3. Probability at certain $x$ value, $P(X = x)$ can be directly obtained in:

          • PMF for discrete case

          • PDF for continuous case


        4. Probability for values less than $x$, $P(X < x)$ or Probability for values within a range from $a$ to $b$, $P(a < X < b)$ can be directly obtained in:

          • CDF for both discrete / continuous case


        5. Distribution function is referred to CDF or Cumulative Frequency Function (see this)

        In terms of Acquisition and Plot Generation Method



        1. Collected data appear as discrete when:

          • The measurement of a subject is naturally discrete type, such as numbers resulted from dice rolled, count of people.

          • The measurement is digitized machine data, which has no intermediate values between quantized levels due to sampling process.

          • In later case, when resolution higher, the measurement is closer to analog/continuous signal/variable.


        2. Way of generate a PMF from discrete data:

          • Plot a histogram of the data for all the $x$'s, the $y$-axis is the frequency or quantity at every $x$.

          • Scale the $y$-axis by dividing with total number of data collected (data size) $longrightarrow$ and this is called PMF.


        3. Way of generate a PDF from discrete / continuous data:

          • Find a continuous equation that models the collected data, let say normal distribution equation.

          • Calculate the parameters required in the equation from the collected data. For example, parameters for normal distribution equation are mean and standard deviation. Calculate them from collected data.

          • Based on the parameters, plot the equation with continuous $x$-value $longrightarrow$ that is called PDF.


        4. How to generate a CDF:

          • In discrete case, CDF accumulates the $y$ values in PMF at each discrete $x$ and less than $x$. Repeat this for every $x$. The final plot is a monotonically increasing until $1$ in the last $x$ $longrightarrow$ this is called discrete CDF.

          • In continuous case, integrate PDF over $x$; the result is a continuous CDF.


        Why PMF, PDF and CDF?



        1. PMF is preferred when

          • Probability at every $x$ value is interest of study. This makes sense when studying a discrete data - such as we interest to probability of getting certain number from a dice roll.


        2. PDF is preferred when

          • We wish to model a collected data with a continuous function, by using few parameters such as mean to speculate the population distribution.


        3. CDF is preferred when

          • Cumulative probability in a range is point of interest.

          • Especially in the case of continuous data, CDF much makes sense than PDF - e.g., probability of students' height less than $170$ cm (CDF) is much informative than the probability at exact $170$ cm (PDF).






        share|cite|improve this answer











        $endgroup$















          32












          32








          32





          $begingroup$

          Distribution Function



          1. The probability distribution function / probability function has ambiguous definition. They may be referred to:

            • Probability density function (PDF)

            • Cumulative distribution function (CDF)

            • or probability mass function (PMF) (statement from Wikipedia)


          2. But what confirm is:

            • Discrete case: Probability Mass Function (PMF)

            • Continuous case: Probability Density Function (PDF)

            • Both cases: Cumulative distribution function (CDF)


          3. Probability at certain $x$ value, $P(X = x)$ can be directly obtained in:

            • PMF for discrete case

            • PDF for continuous case


          4. Probability for values less than $x$, $P(X < x)$ or Probability for values within a range from $a$ to $b$, $P(a < X < b)$ can be directly obtained in:

            • CDF for both discrete / continuous case


          5. Distribution function is referred to CDF or Cumulative Frequency Function (see this)

          In terms of Acquisition and Plot Generation Method



          1. Collected data appear as discrete when:

            • The measurement of a subject is naturally discrete type, such as numbers resulted from dice rolled, count of people.

            • The measurement is digitized machine data, which has no intermediate values between quantized levels due to sampling process.

            • In later case, when resolution higher, the measurement is closer to analog/continuous signal/variable.


          2. Way of generate a PMF from discrete data:

            • Plot a histogram of the data for all the $x$'s, the $y$-axis is the frequency or quantity at every $x$.

            • Scale the $y$-axis by dividing with total number of data collected (data size) $longrightarrow$ and this is called PMF.


          3. Way of generate a PDF from discrete / continuous data:

            • Find a continuous equation that models the collected data, let say normal distribution equation.

            • Calculate the parameters required in the equation from the collected data. For example, parameters for normal distribution equation are mean and standard deviation. Calculate them from collected data.

            • Based on the parameters, plot the equation with continuous $x$-value $longrightarrow$ that is called PDF.


          4. How to generate a CDF:

            • In discrete case, CDF accumulates the $y$ values in PMF at each discrete $x$ and less than $x$. Repeat this for every $x$. The final plot is a monotonically increasing until $1$ in the last $x$ $longrightarrow$ this is called discrete CDF.

            • In continuous case, integrate PDF over $x$; the result is a continuous CDF.


          Why PMF, PDF and CDF?



          1. PMF is preferred when

            • Probability at every $x$ value is interest of study. This makes sense when studying a discrete data - such as we interest to probability of getting certain number from a dice roll.


          2. PDF is preferred when

            • We wish to model a collected data with a continuous function, by using few parameters such as mean to speculate the population distribution.


          3. CDF is preferred when

            • Cumulative probability in a range is point of interest.

            • Especially in the case of continuous data, CDF much makes sense than PDF - e.g., probability of students' height less than $170$ cm (CDF) is much informative than the probability at exact $170$ cm (PDF).






          share|cite|improve this answer











          $endgroup$



          Distribution Function



          1. The probability distribution function / probability function has ambiguous definition. They may be referred to:

            • Probability density function (PDF)

            • Cumulative distribution function (CDF)

            • or probability mass function (PMF) (statement from Wikipedia)


          2. But what confirm is:

            • Discrete case: Probability Mass Function (PMF)

            • Continuous case: Probability Density Function (PDF)

            • Both cases: Cumulative distribution function (CDF)


          3. Probability at certain $x$ value, $P(X = x)$ can be directly obtained in:

            • PMF for discrete case

            • PDF for continuous case


          4. Probability for values less than $x$, $P(X < x)$ or Probability for values within a range from $a$ to $b$, $P(a < X < b)$ can be directly obtained in:

            • CDF for both discrete / continuous case


          5. Distribution function is referred to CDF or Cumulative Frequency Function (see this)

          In terms of Acquisition and Plot Generation Method



          1. Collected data appear as discrete when:

            • The measurement of a subject is naturally discrete type, such as numbers resulted from dice rolled, count of people.

            • The measurement is digitized machine data, which has no intermediate values between quantized levels due to sampling process.

            • In later case, when resolution higher, the measurement is closer to analog/continuous signal/variable.


          2. Way of generate a PMF from discrete data:

            • Plot a histogram of the data for all the $x$'s, the $y$-axis is the frequency or quantity at every $x$.

            • Scale the $y$-axis by dividing with total number of data collected (data size) $longrightarrow$ and this is called PMF.


          3. Way of generate a PDF from discrete / continuous data:

            • Find a continuous equation that models the collected data, let say normal distribution equation.

            • Calculate the parameters required in the equation from the collected data. For example, parameters for normal distribution equation are mean and standard deviation. Calculate them from collected data.

            • Based on the parameters, plot the equation with continuous $x$-value $longrightarrow$ that is called PDF.


          4. How to generate a CDF:

            • In discrete case, CDF accumulates the $y$ values in PMF at each discrete $x$ and less than $x$. Repeat this for every $x$. The final plot is a monotonically increasing until $1$ in the last $x$ $longrightarrow$ this is called discrete CDF.

            • In continuous case, integrate PDF over $x$; the result is a continuous CDF.


          Why PMF, PDF and CDF?



          1. PMF is preferred when

            • Probability at every $x$ value is interest of study. This makes sense when studying a discrete data - such as we interest to probability of getting certain number from a dice roll.


          2. PDF is preferred when

            • We wish to model a collected data with a continuous function, by using few parameters such as mean to speculate the population distribution.


          3. CDF is preferred when

            • Cumulative probability in a range is point of interest.

            • Especially in the case of continuous data, CDF much makes sense than PDF - e.g., probability of students' height less than $170$ cm (CDF) is much informative than the probability at exact $170$ cm (PDF).







          share|cite|improve this answer














          share|cite|improve this answer



          share|cite|improve this answer








          edited Mar 24 at 2:42









          Rócherz

          3,0263823




          3,0263823










          answered Mar 3 '14 at 10:14









          user132704user132704

          321133




          321133





















              0












              $begingroup$

              Some abuse of language exists in these terms, which can vary. Below is a common usage.



              In the continuous case (density):



              (continuous) probability distribution function = probability density function = density function
              (continuous) probability distribution = density


              In the discrete case (mass/distribution):



              (discrete) probability distribution function = probability mass function
              (discrete) probability distribution = distribution


              Oddly enough, you may never see a probability mass function called a mass function or a distribution function, nor may you see a discrete probability distribution called a mass. I am sure there is some historical reason as to why. As they say, das war schon immer so und wird auch immer so bleiben.






              share|cite|improve this answer









              $endgroup$

















                0












                $begingroup$

                Some abuse of language exists in these terms, which can vary. Below is a common usage.



                In the continuous case (density):



                (continuous) probability distribution function = probability density function = density function
                (continuous) probability distribution = density


                In the discrete case (mass/distribution):



                (discrete) probability distribution function = probability mass function
                (discrete) probability distribution = distribution


                Oddly enough, you may never see a probability mass function called a mass function or a distribution function, nor may you see a discrete probability distribution called a mass. I am sure there is some historical reason as to why. As they say, das war schon immer so und wird auch immer so bleiben.






                share|cite|improve this answer









                $endgroup$















                  0












                  0








                  0





                  $begingroup$

                  Some abuse of language exists in these terms, which can vary. Below is a common usage.



                  In the continuous case (density):



                  (continuous) probability distribution function = probability density function = density function
                  (continuous) probability distribution = density


                  In the discrete case (mass/distribution):



                  (discrete) probability distribution function = probability mass function
                  (discrete) probability distribution = distribution


                  Oddly enough, you may never see a probability mass function called a mass function or a distribution function, nor may you see a discrete probability distribution called a mass. I am sure there is some historical reason as to why. As they say, das war schon immer so und wird auch immer so bleiben.






                  share|cite|improve this answer









                  $endgroup$



                  Some abuse of language exists in these terms, which can vary. Below is a common usage.



                  In the continuous case (density):



                  (continuous) probability distribution function = probability density function = density function
                  (continuous) probability distribution = density


                  In the discrete case (mass/distribution):



                  (discrete) probability distribution function = probability mass function
                  (discrete) probability distribution = distribution


                  Oddly enough, you may never see a probability mass function called a mass function or a distribution function, nor may you see a discrete probability distribution called a mass. I am sure there is some historical reason as to why. As they say, das war schon immer so und wird auch immer so bleiben.







                  share|cite|improve this answer












                  share|cite|improve this answer



                  share|cite|improve this answer










                  answered Feb 21 at 4:38









                  Adam EricksonAdam Erickson

                  1114




                  1114















                      protected by Zev Chonoles Aug 23 '16 at 9:24



                      Thank you for your interest in this question.
                      Because it has attracted low-quality or spam answers that had to be removed, posting an answer now requires 10 reputation on this site (the association bonus does not count).



                      Would you like to answer one of these unanswered questions instead?



                      Popular posts from this blog

                      Lowndes Grove History Architecture References Navigation menu32°48′6″N 79°57′58″W / 32.80167°N 79.96611°W / 32.80167; -79.9661132°48′6″N 79°57′58″W / 32.80167°N 79.96611°W / 32.80167; -79.9661178002500"National Register Information System"Historic houses of South Carolina"Lowndes Grove""+32° 48' 6.00", −79° 57' 58.00""Lowndes Grove, Charleston County (260 St. Margaret St., Charleston)""Lowndes Grove"The Charleston ExpositionIt Happened in South Carolina"Lowndes Grove (House), Saint Margaret Street & Sixth Avenue, Charleston, Charleston County, SC(Photographs)"Plantations of the Carolina Low Countrye

                      random experiment with two different functions on unit interval Announcing the arrival of Valued Associate #679: Cesar Manara Planned maintenance scheduled April 23, 2019 at 00:00UTC (8:00pm US/Eastern)Random variable and probability space notionsRandom Walk with EdgesFinding functions where the increase over a random interval is Poisson distributedNumber of days until dayCan an observed event in fact be of zero probability?Unit random processmodels of coins and uniform distributionHow to get the number of successes given $n$ trials , probability $P$ and a random variable $X$Absorbing Markov chain in a computer. Is “almost every” turned into always convergence in computer executions?Stopped random walk is not uniformly integrable

                      How should I support this large drywall patch? Planned maintenance scheduled April 23, 2019 at 00:00UTC (8:00pm US/Eastern) Announcing the arrival of Valued Associate #679: Cesar Manara Unicorn Meta Zoo #1: Why another podcast?How do I cover large gaps in drywall?How do I keep drywall around a patch from crumbling?Can I glue a second layer of drywall?How to patch long strip on drywall?Large drywall patch: how to avoid bulging seams?Drywall Mesh Patch vs. Bulge? To remove or not to remove?How to fix this drywall job?Prep drywall before backsplashWhat's the best way to fix this horrible drywall patch job?Drywall patching using 3M Patch Plus Primer