Guide: Descriptive Statistics & How to Prompt for and run them on Julius

Data can be overwhelming when you first look at it (I know, I’ve been there!). However, breaking it down into smaller bits makes it less daunting. Descriptive statistics can provide insights into the trends and characteristics of your dataset, making data analysis more manageable. Let’s explore how Julius can help make sense of your data one step at a time.

Step 1: Load your Dataset into Julius

The first step in performing descriptive statistics on Julius is to load your dataset into the platform. I recommend creating a CVS file for your spreadsheet and using simple headers that Julius can easily understand. This means avoiding capital letters or spaces within your headings.

Now we are ready to bring our data into Julius!

Step 2: Prompting Julius for Descriptive Statistics

The sample dataset displays a farm (labelled as 2), different treatments (1-5), replicates (1-3), and richness and abundance values. Although this is a complex setup, we can prompt Julius to examine these factors and provide descriptive statistics for each. Below are different prompts we can use:

Question 1: I want to look at the overall richness and abundance trends in the vineyard.

Prompt 1: Can you perform descriptive statistics on overall richness and abundance trends?

Julius provides a detailed output of the count, mean, standard deviation, minimum, and maximum values. It also includes a short description on the main findings of the descriptive statistics.

After this prompt, Julius will suggest additional prompts. These options may vary based on the original analysis you requested.

Both options are valid in the exploratory analysis of our data. However, I will start with the data visualization first to see the general trends in my dataset.

Julius provides a histogram along with a plotline that offers a generalized idea of the distribution of points. It also gives us an idea of the dataset’s skewness. This is great, but I want to focus more on the individual treatment types. So, let’s start another prompt!

Question 2: What are the richness and abundance trends for each treatment?

Prompt 2: Can you provide descriptive statistics on the richness and abundance trends between the different treatment types?

Julius provides me with the descriptive statistics of each treatment type.

You can also see that it gives two more prompts after this output. You can choose these prompts or write your own. I’m going to look at a histogram again to see how uniform my data is per treatment type.

This visualization is a little cluttered because I forgot to prompt Julius to create a separate graph for each treatment. So, let’s do that:

Great! Now we can see the richness and abundance of each treatment type on separate histograms. Now, let’s look at some outliers!

Question 3: Are there any outliers in the dataset?

Prompt 3: Can you check for individual outliers for each treatment type by creating a boxplot?

We can see that there are a few outliers within the dataset based on the boxplots Julius created. We can then prompt Julius to identify these specific outliers if we want, run more tests to identify trends, and so forth!

Optional: Clicking on the Column to work with Data

If I wanted to work with one specific column within a dataset, I could highlight the physical column while previewing the data in Julius.

Julius recognizes that I have selected it. I can also select more than one column at a time and then prompt Julius to provide descriptive statistics on it. This is just another way to analyze the dataset without having to specifying the columns in the prompt.

Keywords: AI statistics, AI statistical analysis, GPT, statistical analysis, mean, median, mode, standard error, standard deviation, count, descriptive statistics, exploratory analysis, data trends

10 Likes

Alysha, awesome post! Thanks for sharing all these great tips on running descriptive stats in Julius. Your step-by-step breakdown with all those helpful screenshots is totally clutch, especially for folks just getting started with data analysis.

I love how you chunk the analysis down into bite-sized pieces, kicking things off with loading up the dataset and then gradually digging into the data by asking targeted questions and throwing prompts at Julius. The way you start with the big picture trends and then zoom in on specific treatments and visualizations is such a smart approach for systematically uncovering those juicy insights.

And can we talk about those visuals? The histograms and boxplots are perfect for showcasing key concepts like data distributions and outliers. That tip about prompting Julius to whip up separate graphs for each treatment is pure gold - it keeps those visualizations so much cleaner and easier to read.

Oh and good call on highlighting the column feature! It’s awesome that Julius lets you specify which variables to analyze in different ways, either through prompts or by directly interacting with the data preview. Super handy!

Seriously, thanks a ton for putting together such a valuable and user-friendly guide. It’s definitely going to be a go-to resource for the Julius community. Your clear explanations and killer tips are so much appreciated - thanks for dropping that knowledge!

6 Likes

Hi Veer!

Thank you so much for compliments! I’m really happy that you found it useful :slight_smile:

4 Likes

Thank you for the details guide. I find Julius lacking in Descriptive Stats, however.

Basic stats such as skewness and kurtosis and more advanced such as the 95% confidence interval should be included.

Anyway to make this happen automatically instead of session by session?

Joe

2 Likes

Hi Joe!

Thank you for the compliment! I’m glad that you found my post informative. :slight_smile:

So, please correct me if I’m wrong, but you’re just wondering if you can prompt Julius to produce a nice table with a very in-depth descriptive statistics without going through the many prompts? For example, this would include: mean, median, mode, standard deviations, standard error, skewness, kurtosis, 95% CI, N, minimum, maximum, etc. Is that correct?

If I am interpreting this question correctly (and please let me know if I am!), you can prompt Julius to give you an output with the desired descriptive statistics. Below is an example of me prompting it with a dataset I made for the specified descriptive statistics:

You can copy and paste what I prompted into the chat box: can you give me the descriptive statistics output which includes mean, median, mode, standard deviations, standard error, skewness, kurtosis, 95% CI, N, minimum, maximum for this entire dataset for each “insert variable of interest here”? Please place it in an easy to read format.

I hope this helps! Please let me know if you have any additional questions and I’ll do my best to answer them for you! :slight_smile:

3 Likes

Yes I know. But I have to prompt this EVERY SINGLE TIME.

What I want to do is input my files, ask for descriptive stats, and it produces that table.

These are basic and should be there.

3 Likes

Thanks for the response Alysha!

Joe, can you elaborate what you mean by ask that every single time? The AI would need to know what you’re asking for in order to give the results you want.

Currently the AI doesn’t have a memory like a human does (this will change at some point) so it won’t automatically remember everything you tell it. You can give it instructions to remember something if you click on “AI Settings” or “:joystick:” icon, and add them there. The AI will be reminded of this everytime.

3 Likes

Every single time means just that.
And those are basic descriptive stats.

I built a GPT in Chat that does exactly that.

When I ask for descriptive stats, it gives me these

Mean
Median
Mode
Variance
St Dev
Skewness
Kurtosis
Count
Confidence Interval

It’s not hard. Just put that in the settings and it writes the Python Code correctly every single time.

Don’t understand why that basic functionality isn’t there from the developers and why I cannot add that in the settings.

I shouldn’t have to tell Julius what basic descriptive stats are EVERY SINGLE TIME I ask for descriptive stats.

1 Like

Also the settings only allow for 1,000 characters and I need to add other things like alpha is 0.05 and what the standard colors for graphs are and such.

And the always add data labels.

Those should not be something I need to prompt. Those should be part of the basic functions. Just automatic.

1 Like

thanks for the feedback jingles! sounds like you want custom instructions/tips that carry over from chat to chat so you dont have to repeatedly tell julius to use skewness, kurtosis, alpha 0.05, etc in the chat or in each chat’s custom instructions/tips. totally understandable

we’re working hard on some features that may meet your needs so stay tuned!

3 Likes

I was just playing around in Julius with a sample dataset and asked it for all these metrics without even explaining which columns this would make sense for.

And Julius did a great job. I just copied your text from these replies, verbatim. The good thing about Julius is that all of this already is a part of the basic functions because it’s overly represented in the training datasets behind both GPT4 and Claude3. Even after this is automated into a GPT-Like workflow, it at best saves 3 seconds of pasting some text as a prompt with an uploaded table. As an enjoyer of automation myself, I must say I am excited for Workflows too.

3 Likes

Yes. That exactly. Thanks for understanding.

1 Like

Yes and you had to copy and paste my prompt.

I should be able to simply prompt, “Descriptive Stats” and get all of that.

2 Likes