Sunday 31 August 2014

Team-2 ( Types of Excel graphs)

There are many graphs or charts available in spreadsheet programs such as Excel and each has it own uses. They are as follows:

  • Bar graphs
  • Line graphs
  • Scatter plot graphs
  • pie charts
Bar graphs: Bar graphs also called column graphs are one of the most common types of graph used to display data. These are mostly used to show amounts or the number of times a value occurs. The amounts are displayed using a vertical bar or rectangle. The taller the bar, the greater number of times the value occurs.
http://spreadsheets.about.com/od/spreadsheetlessons/ss/excel_graph_use_2.htm

Line graphs: Line graphs are used to plot changes in data over time, such as monthly temperature changes or daily changes in stock market prices. They can also be used to plot data recorded from scientific experiments, such as how a chemical reacts to changing temperature or atmospheric pressure.
http://spreadsheets.about.com/od/spreadsheetlessons/ss/excel_graph_use_3.htm

Scatter plot graphs: Scatter plot graphs are used to show trends in data. They are especially useful when you have a large number of data points.
            When compared  line graphs connect the dots or points of data to show every change, whereas a scatter plot you draw a "best fit"line. The data points are scattered about the line. The closer the data points are to the line the stronger the correlation or affect one variable has on the other.
http://spreadsheets.about.com/od/spreadsheetlessons/ss/excel_graph_use_4.htm

Pie charts: Pie charts do not use horizontal and vertical axes to plot points like the others. They are used to chart only one variable at a time. As a result, it can only be used to show percentages.
The circle of pie charts represents 100%. The circle is subdivided into slices representing data values. The size of each slice shows what part of the 100% it represents.
http://spreadsheets.about.com/od/spreadsheetlessons/ss/excel_graph_use_5.htm

Bubble charts: Bubble charts are one way to show three dimensions of data in a flat 2D chart.  Bubble charts can facilitate the understanding of social, economical, medical, and other scientific relationships.
Bubble charts can be considered a variation of the scatter plot, in which the data points are replaced with bubbles.





Saturday 30 August 2014

Bubble Charts in excel and in R

A Bubble chart is a variation of a Scatter chart in which the data points are replaced with bubbles. A Bubble chart can be used instead of a Scatter chart if your data has three data series, each of which contains a set of values. In a Bubble chart, the size of the bubbles is determined by the values in the third data series. Bubble charts are often used to present financial data. We are not limited to the circular markers provided by default in bubble charts. We can use the technique described in custom chart series markers to use any shape you want for the markers. Motion charts are animated bubble charts. They track the performance of important things over time (in two dimensions). This is chart with 4 dimensions. But not all data with 4 dimensions can be plotted as a motion chart. One dimension has to be time, and another has to be linked to the importance of the item. we can also turn any bubble chart into a quadrant chart. This is useful if you want to categorize your data a little further by using a common layout like it is used in a SWOT or market attractiveness analysis. 
Making bubble charts in R:
Step 1:                    
Download R and load the data into R for which a bubble chart has to be created. A .csv file with data in it can be read into R using the command read.csv
Step 2:
Circles can be drawn using the command symbols() and the values for x-axis, y-axis and bubbles are given. By default,symbols() sizes the largest bubble to one inch, and then scales the rest accordingly. It can be changed by using the inches argument.
Step 3:
To change the color of the bubbles we can use the arguments fg and bg to change the foreground and background colors respectively.
Eg:
symbols(table$cloumnname1, table$columnname2, circles=radius, inches=0.35, fg="white", bg="red", xlab="  ", ylab="  ")
The xlab and ylab arguments are used to name the X-axis and Y-axis.

The shapes of the bubbles can be changed by changing the values in the “inches” argument. This changes the shape of the circle to square or any other shape.

Sources:
Team 3 :Bubble chart and Hans Rosling

bubble chart is not only used for representation of data with the additional information along with x-axis and y-axis with data of time period  this extended in large to form the visualisations famously knows as motion chart.hans rosling is the famous medical doctor, statistician he born in sweden but studied higher education india but majority of his work  done
is  in medical services in rural areas across the world .he was one of the cofounder of the gapminder foundation along with his son and daughter in law.Gapminder foundation received several awards for the visualize of the web development in the world he worked in famous organisations like WHO,UNICEF.

One Example for Bubble chart



source

Friday 29 August 2014

TEAM 4


Data Cleaning:
        Data cleaning is a process of detecting and correcting corrupted data or inaccurate data from a record set or tables or database. It is mainly used in databases and the term refers to identifying incomplete, incorrect, inaccurate, irrelevant data from parts of the data and then replacing or modifying or deleting the data.
         Data cleaning differs from data validation in that validation almost invariably means data is rejected from the system at entry and is performed at entry time, rather than on batches of data.The actual process of data cleansing may involve removing typographical errors or validating and correcting values against a known list of entities.Data cleansing may also involve activities like, harmonization of data, and standardization of data. For example, harmonization of short codes (St, rd etc.) to actual words (street, road). Standardization of data is a means of changing a reference data set to a new standard, ex, use of standard codes.
        There  are various types of charts how data can be visible clear in charts.
Bar Chart:
      A bar graph is a chart that uses either horizontal or vertical bars to show comparisons among categories. One axis of the chart shows the specific categories being compared, and the other axis represents a discrete value. Some bar graphs present bars clustered in groups of more than one (grouped bar graphs).Bar charts are usually scaled so that all the data can fit on the chart. Bars on the chart may be arranged in any order. Bar charts arranged from highest to lowest incidence are called Pareto charts. Normally, bars showing frequency will be arranged in chronological (time) sequence.
  
                         

Bubble chart:
bubble chart is a type of chart that displays three dimensions of data.Bubble charts can be considered a variation of the scatter plot, in which the data points are replaced with bubbles.This type of chart can be used instead of a Scatter chart if your data has three data series, each of which contains a set of values.
We have to choose the size of bubbles correctly.
                            

Thursday 28 August 2014

Team 3:Today’s class is about how to draw charts and graphs for a given data.


First of all charts and graphs are pictorial representation which makes the information clearer and easier to understand. News publishers use graphics all the time to show comparisons and explain important trends for things such as weather, gas prices, crime rate, or who is winning an election and by how much.

6 common reasons that we often have to make a chart:
1.         to Compare
2.         to show the Distribution
3.         to explain Parts of the Whole
4.         to tell the Trend over time
5.         to find out the Deviations
6.         to understand the Relationship

Types of charts that we use
Ø  To Compare :for comparison purpose we can use Bar Charts, Column Charts, Scatter Plots, Pie Charts, Line Charts, Data Tables
Ø  To Show the Distribution: To show distribution Column Charts, Scatter Plots, Line charts, Box Plots are used
Ø  To explain Parts of Whole: Column Charts, Bar Charts, Pie Charts, Data Table to show required parts from the whole data.
Ø  to tell the Trend over time :for clear representation of trend we can use Column Charts, Line Charts, Data Table
Ø  to find out the Deviations: to show deviation we can use Column Charts, Bar Charts, Line Charts, Data Table
Ø  to understand the Relationship: to show relationship between two or more variables we can use Scatter Plotline  Chart, Data Table

Other types of charts
Doughnut charts
Like a pie chart, a doughnut chart shows the relationship of parts to a whole; however, it can contain more than one data series. •This type of chart displays data in rings, where each ring represents a data series.

Radar chart
A radar chart compares the aggregate values of a number of data series. This chart displays changes in values relative to a center point. It can be displayed with markers for each data point.

bubble chart
A bubble chart is a type of xy (scatter) chart. It compares sets of three values and can be displayed with a 3-D visual effect. The size of the bubble, or data marker, indicates the value of a third variable. To arrange your data for a bubble chart, place the x values in one row or column, and enter corresponding y values and bubble sizes in the adjacent rows or columns. 

Multiple-Variable Graphs


Graphs are one of the easiest way of representing data. Generally we see bar graphs maximum for two independent variables, pie charts. But these will visualize the relation between two or more variables.

1.Scatter plots: These determine the relationship between two continuous variables and whether they are correlated or not.
2. Scatter plot matrix: It is that collection of  scatter plots into grid. It shows relation between pair of variables.
3.Overlay plot: It shows trends across time better than scatter plots do.
4.Side-by-Side box plot: It shows the relation between continuous variable and categorical variable.
5.Variability chart: This chart specifies multiple X variables and see the difference between means and variability across all the variables at once.
6.Graph Builder: We can change the variables and  graph at any point of time in graph builder.
7. Bubble chart: It represents its points as bubbles or circles. We can change the size and colour of the bubbles and animate them any point of time. 

- Group 5

Sunday 17 August 2014

Bubble Chart:



  • A Bubble Chart is a multi-variable graph that resembles a combination of a Scatter plot and a proportional area chart
  • Where X and Y axis are separate variables
  • However unlike a Scatter Plot, each point is assigned a label or category
  • Colours can also be used to distinguish between categories or used to represent an additional data variable.
  •  Time can be shown either by having it as variable on one of the axis or by animating the data variables changing over time.
  • Bubble Charts are typically used to compare and show therelationships between labeled/categorised circles, by the use of positioning and proportions. The overall picture of Bubble Charts can be use to analyse forpatterns/correlations.
This is the Bubble chart between the GDP and CO2 Per capita






http://www.datavizcatalogue.com/methods/bubble_chart.html#.U_DyacWSyd0

Saturday 16 August 2014

Team-3:Simple Techniques for presenting data

Getting data is simple but every analyst has to know how to represent the data in a understandable manner.According to Hans Rosling they are 6 techniques for presenting data.

  1. Explain the data axes: We have to represent the variables along X and Y axis by naming them clearly.
  2. Highlight subsets of data:The representation of graphs should not be complex,it should be as simple as possible,subsets should be differentiated.
  3. Dig deeper to unwrap data:We should represent the Unformatted data into simple graphs
  4. Place labels close to data points:Each and every data point must be represented for better understanding
  5. Answer the “Why?” questions:state the reasons for taking those two variable for representing the graph
  6. Complement data with energetic delivery

team 5

BUBBLE CHART:
A bubble chart is a good alternative to a bar chart that would need to include numerous bars. A bubble chart displays circles instead of bars, with each circle, or “bubble,” getting proportionally larger as the data size increases.

Disadvantage:  Bubble graphs are only visually useful when attempting to show relationships between multiple pieces of data.



source:  http://captus.samhsa.gov/

sample size: is needed for choosing the number of observations or replicates to include in a statistical sample. the sample size is important feature of any empirical study in which the goal is to make inferences about a population from a sample.
confidence interval: is a measure of the reliability of an estimate. it is a type of interval estimate of a population parameter. how frequently the observed interval contains the parameter is determined by the confidence level or confidence coefficient.








TEAM-----#1


Jugaad:

6 JUGAAD Principles

1) Seek opportunity from adversity
2)Do more with less
3)think and act flexibility
4)Keep it simple
5)Include the margin
6)Follow your heart



http://www.slideshare.net/giribiri/jugaad-innovation-34040028?qid=bae94262-b2c9-460e-a068-123342058f22&v=qf1&b=&from_search=3

Team # 2

Bubble Chart-Team # 2


A bubble chart is a variation of a scatter chart in which the data points are replaced with bubbles, and an additional dimension of the data is represented in the size of the bubbles. Just like a scatter chart, a bubble chart does not use a category axis — both horizontal and vertical axes are value axes. In addition to the x values and y values that are plotted in a scatter chart, a bubble chart plots x values, y values, and z (size) values.




Bubble charts plot data defined in terms of three distinct numeric parameters. They allow the comparison of entities in terms of their relative positions with respect to each numeric axis and their size as well.
Column charts and line charts have 2 axes commonly – a numeric axis and a categorical axis. The Y-Axis is the numeric axis for column and line charts. This means that the quantitative magnitude of the plot is indicated by the position of the plot with respect to the Y-axis. Bubble charts are different, because both axes of a bubble chart are numeric. Hence, the position of the plot is an indicator of two numeric values.  The area of the plot is an indicator of the magnitude of the third numeric characteristic.
Bubble chart showing investment in equities

Data Representation - Team # 4

Data representation using bubble chart

Bubble chart is similar to that of a scatter plot where the data points of a scatter plot are replaced by a bubble. We can use bubble chart if we have three data series with different values. The size of the bubble depends on one of the variable.
Bubble charts are generally used to represent financial data. We can use a bubble chart if we have data in the following format:
o   Three variables are needed for a data point i.e. X,Y and size of the bubble.
o   We can also represent negative data values in a bubble chart. They are represented using white bubbles and we cannot format those bubbles.

This link provides more details about how to create a bubble chart, format the bubble chart and an example for bubble chart.

Difference between a bar chart and histogram :
In a bar chart the data in the columns are positioned in a label that represents categorical data and the height of the column represents the size of the group defined.
In case of a histogram the columns are positioned over a label that represents a quantitative variable. The column variable can be single value or range of values.




Sai Durga
Team # 4





Friday 15 August 2014

Team 4


Jugaad:
Jugaad is a colloquial Hindi word that can mean an innovative fix or a simple work-around, sometimes pejoratively used for solutions that bend rules, or a resource that can be used as such, or a person who can solve a complicated issue.
Jugaad is an Indian phrase used to describe the kind of ingenuity that enables Indians to manage the large and small challenges of everyday life. The essence of Jugaad is to improvise, and to quickly and cheaply cobble a solution together from the materials you have at hand.
The Chotokool refrigerator
http://innovationmanagement.se/wp-content/uploads/2012/12/Chotokool.jpgDuring the development of Chotokool, a very cheap and compact refrigerator designed for families and small retailers in the country, Godrej’s designers went to visit potential customers in villages many times to understand their needs. Later in the development process, the designers returned with prototypes and collected more responses and ideas from end users. In total, the team spent 50 days with users in villages.
The result – called Chotokool – was a refrigerator with a 43liter volume. Cooling is done using so-called “Peltier” technology. It draws only between 40 and 60 watts.Chotokool can operate from batteries or solar cells. The box is made of plastic and the entire unit is assembled from a total of 20 parts. The Chotokool is opened from above, so the cold air stays down in the box. The price is Rs. 3,400 – approx. US$ 65.
This is the book that explains the importance of jugaad:

http://jugaadinnovation.com/wp-content/uploads/2012/02/Jugaad-Innovation-excerpt-final.pdf
Some images :




Analogy
An analogy is a comparison in which an idea or a thing is compared to another thing that is quite different from it. It aims at explaining that idea or thing by comparing it to something that is familiar. Metaphors and similes are tools used to draw an analogy. Therefore, analogy is more extensive and elaborate than either a simile or a metaphor. Consider the following example:
“Structure of an atom is like a solar system. Nucleus is the sun and electrons are the planets revolving around their sun.”
Here an atomic structure is compared to a solar system by using “like”. Therefore, it is a simile.
Metaphor :
A metaphor is a figure of speech that describes a subject by asserting that it is, on some point of comparison, the same as another otherwise unrelated object. It is a figure of speech comparing two unlike things without using either "like" or "as". It is not to be mistaken with a simile which does use "like" or "as" in comparisons. Metaphor is a type of analogy and is closely related to other rhetorical figures of speech that achieve their effects via association, comparison or resemblance includingallegory, hyperbole, and simile..

Levels of measurement:
Nominal The essential point about nominal scales is that they do not imply any ordering among the responses. For example, when classifying people according to their favorite color, there is no sense in which green is placed "ahead of" blue. Responses are merely categorized. Nominal scales embody the lowest level of measurement
Examples of Nominal Scales
Ordinal refers to quantities that have a natural ordering. The ranking of favorite sports, the order of people's place in a line, the order of runners finishing a race or more often the choice on a rating scale from 1 to 5.
Example of Ordinal Scales
Interval data is like ordinal except we can say the intervals between each value are equally split. The most common example is temperature in degrees Fahrenheit. The difference between 29 and 30 degrees is the same magnitude as the difference between 78 and 79 (although I know I prefer the latter). With attitudinal scales and the Likert questions you usually see on a survey, these are rarely interval, although many points on the scale likely are of equal intervals.
example of interval scale
Ratio data is interval data with a natural zero point. For example, time is ratio since 0 time is meaningful. Degrees Kelvin has a 0 point (absolute 0) and the steps in both these scales have the same degree of magnitude.
Example of Ratio Scales
This Device Provides Two Examples of Ratio Scales (height and weight)
Summary

Examples
Nominal
Ordinal
Interval
Ratio
frequency distribution.
Yes
Yes
Yes
Yes
median and percentiles.
No
Yes
Yes
Yes
add or subtract.
No
No
Yes
Yes
mean, standard deviation, standard error of the mean.
No
No
Yes
Yes
ratio, or coefficient of variation.
No
No
No
Yes