question archive CH110 ASSIGNMENT: To Excel in Chemistry Assignment Instructions: Hand in the fill-in-the-blanks sheets along with any supporting evidence such as required plots and table of values
Subject:ChemistryPrice:10.99 Bought8
CH110 ASSIGNMENT: To Excel in Chemistry
Assignment Instructions: Hand in the fill-in-the-blanks sheets along with any supporting evidence such as required plots and table of values. All work must be completed in pen (not pencil) or typed (preferred) directly in this document. No other lab report component is required for this assignment. Note that 15 marks out of 100 are assigned for including units and the correct number of significant digits so double check your work before handing in.
DUE DATE: Tuesday November 10th, 11:59PM to the online MyLS CH110 lab dropbox as ONE DOCUMENT/submission.
Google Sheets CANNOT be used to complete this assignment. Be sure to download the free version of Microsoft Office available to all WLU students.
To manipulate chemical data in a graphing program and to perform linear regression and data analysis. To learn techniques to write proper computer generated scientific lab reports.
Course Learning Outcome
? Demonstrate competency in linear graphical data analysis
Introduction |
In chemistry, is it often useful to determine the mathematical relationship between two physical parameters. Graphs are useful tools that can help discover and elucidate such relationships. First, plotting a graph provides a visual image of data and any trends therein. Second, analysis of the graph can provide a mathematical expression that describes the relationship between the two physical parameters. Third, graphs can be used to estimate the values of the physical parameters that are beyond the range of the data set and that are between observed data points; the former process is called extrapolation while the latter is called interpolation.
One common relationship between physical parameters is a linear relationship. A linear relationship occurs when two or more quantities are proportional to one another. When a linear relationship exists between physical parameters and those parameters are graphed, a distinct pattern is observed—the plotted points resemble points on a straight line. A straight line can be fitted to the plotted points. Historically, fitting a line to plotted points was done by hand and with a straight edge but today modern software is used for this purpose. Software uses a statistical technique known as linear regression to create a straight line that is the best fit between all the data points on a plot. In addition, software often provides a mathematical expression of the line, usually in the familiar slope-intercept form: y = mx + b. This equation expresses the mathematical relationship between the two variables plotted, and allows extrapolation and interpolation. Figure 1 shows the general form for a straight line graph.
Slope-intercept form of best-fit line:
y = mx + b
Δy
y-intercept: b
Δx Slope: m = Δy/Δx = y2 − y1/x2 − x1
y |
|
a |
x |
i |
s |
best-fit line |
b |
x- axis
Figure 1: The general form of a graph displaying graph components.
Computer spreadsheets are powerful tools for manipulating and graphing quantitative data. In this exercise, the spreadsheet program Microsoft Excel? will be used for this purpose. In particular, students will learn to use Excel in order to explore a number of linear graphical relationships.
Apple Computer Users: These instructions were written to apply to Microsoft Excel PC. There may be some difficulty in applying these instructions if an Apple computer system is used. However, the procedures for Excel for Mac should be similar to those provided here. There are two options: Complete the assignment using your Apple computer or use a school computer that has Microsoft Excel installed.
Part 1: Simple Linear Plot
Scenario: A gas laws experiment is designed to measure the volume of 1 mole of helium gas at a variety of different temperatures, while keeping the gas pressure constant at 758 Torr. The data that was collected is given below in table 1.
Table 1: Temperature and Volume of Helium data collected in a gas laws experiment.
Temperature (K) |
Volume of Helium (L) |
203 |
14.3 |
243 |
17.2 |
283 |
23.1 |
323 |
25.9 |
363 |
31.5 |
Figure 2: Highlighted data set of Temperature and Volume of Helium
Click on the “Insert” tab at the top left, followed by “Scatter” button (Figure 3).
Figure 3 |
: |
Highlighting the Insert Tab which displays the Scatter button |
. |
Choose the scatter graph that shows data points only. This is the option labeled “Scatter with Only Markers” (Figure 4).
Figure 4: Scatter with Only Marks button
You should now see a scatter plot on your Excel screen, which provides a preview of your graph (Figure 5).
Figure 5: Preview of Excel graph with data from table 1.
If all looks well, it is time to add titles and label the axes of your graph.
To change the default title to the title you have chosen, click inside the text box to place the cursor in the box then delete the default title and add your title.
location of the Chart Title and Axis Title buttons on the Layout tab. |
|
Figure 6 |
: |
Chart Title and Axis Title buttons needed to create chart and axis titles. |
Now click on the “Trendline” button which is located in the “Layout” tab under “Chart Tools”. Choose “More Trendline Options”. The “Format Trendline” window should now appear (Figure 7).
The radio button for the “Linear” trendline, under “Trend/Regression Type” should be selected, it is usually the default trendline option, but if it is not, select it.
Now select the “Display Equation on Chart” box and the “Display R-squared value on
Chart” box by clicking in the check-box to the left of these options. Then click the
“Close” button. |
Figure 7 |
: |
Format trendline window displayed in Excel spreadsheet. |
Figure 8 |
: |
Axis options window open to change minimum x-axis value. |
Method (1): Graphical Extrapolation.
Graphical extrapolation uses the graph and the best-fit-line to estimate the coordinates of a point that is beyond the range of the data set. This is accomplished by extending the best-fit-line and measuring on the graph the coordinates of the desired point.
Figure 9 |
: |
|
Displays the backward forecast box in the format trendline window. |
Method (2): Extrapolation via Equation of the line.
This method is largely a mathematical method, only the equation of the best-fit line is needed.
Part 2: Using Functions |
Scenario: A second gas laws experiment is designed to measure how quickly helium gas diffuses from a source to a detector ten meters away. The temperature of the sourced is varied and the time of diffusion is measured. The data collected from the experiment are given in Table 2.
Table 2: Temperature and Time of Diffusion Data for Helium Gas.
Temperature (K) |
Time of Diffusion (ms) |
2.00 × 102 |
9.72 |
3.00 × 102 |
7.93 |
4.00 × 102 |
6.87 |
5.00 × 102 |
6.15 |
6.00 × 102 |
5.61 |
7.00 × 102 |
5.19 |
8.00 × 102 |
4.86 |
Create a plot of time versus temperature. Follow the procedure as outlined in Part 1. Include the R2 value. Answer these questions in your lab report: Why is time plotted on the y-axis? Which set of data is placed on the left in the Excel worksheet?
diffusion. You will convert each temperature value in Table 2 into
You will instruct Excel to fit a trendline and calculate the R2 value for each of our plots. You will compare all four graphs and use the R2 values to determine which plot gives the most linear relationship between the two variables.
When you release the mouse button, all the cells will fill with the natural logarithm of each corresponding temperature value. This is a quick and easy method to copy the same function into many cells but at the same time changing the reference cell. The reference cell is the cell the formula uses for its input information in its calculation. The reference cell for cell C2 is cell A2. Notice how Excel copies the same formula from cell C2 to cell C3 but it changes the reference cell from A2 to A3. This is called a relative copy and continues down the column if you use the fill handle method. Don’t forget to check a few of the values in column C to ensure that Excel is making the correct calculations.
Figure 10: An example of a fill handle on a selected cell.
Figure 11: Location of the decrease decimal icon in the Home tab.
To remedy this, we can copy the time of diffusion values from column B and paste them into column D. To do this, highlight B2 to B8 by click and dragging the mouse, then select “Copy” and then select cell D2. Select “Paste”. This will paste the time of diffusion data into column D. Create a natural logarithm of the temperature versus the time of diffusion with a best-fit-line and R2 value.
Another method uses the power function. The formula to be entered is “=POWER(A2,0.5)”, where POWER() is the formula, A2 is the reference cell and 0.5 is the power to which the reference is raised. Another form of the power formula is “=A2^0.5”, where the circumflex character ( ^) means “raised to the power of”.
For the formula to calculate the inverse of the square root of the temperature there are a number of formulas: “=1/SQRT(A2)”, “=POWER(A2,-0.5)”, “=POWER(A2,−0.5)”, =( A2)^(−0.5). Once you have all the columns finished, complete the plots, fit a trendline and R2 values to the plots and determine which is the most linear.
e) Include all graphs along with the table of values that was used to create all four graphs .
Include this in your assignment as part of one document
For reference: To print just the data and not the graphs, highlight all the data including titles and select “File” then “Print”. From the drop down menu under settings that currently says “Print Active Sheets”, click on the down arrow and select “Print Selection”. Then only the highlighted data should be printed.
Part 3: Two Data Sets with Overlay
Scenario: In a Beer’s Law experiment, a spectrophotometer is used to measure the amount of light absorbed for two different coloured nanoprisms. The two sets of data collected are
presented in the table below. |
Table 3: Concentration and Absorbance values for two Nanoprisms A and B. |
Nanoprism A |
Nanoprism B |
Concentration A |
( |
mol/L |
) |
Absorbance |
( |
unitless |
) |
Concentration B |
mol/L |
) |
( |
Absorbance |
unitless |
) |
( |
0.100 |
0.049 |
0.800 |
0.125 |
0.200 |
0.168 |
0.850 |
0.285 |
0.300 |
0.261 |
0.900 |
0.440 |
0.400 |
0.360 |
0.950 |
0.620 |
0.500 |
0.470 |
0.600 |
0.590 |
0.700 |
0.700 |
0.750 |
0.750 |
You would like to see how these two sets of data relate to each other. To do this you will have to place both sets of data, as independent relationships, on the same graph. Superimposing plots is a viable option for presenting multiple sets of graphical data if the values of the dependent variables of the two sets of data are similar and if the values of the dependent variables of the two sets of data are similar.
Figure 12: Select Data Source window displaying for Sheet 3.
Figure 13 |
: |
Edit Series window displayed for Nanoprism B Series. |
Series |
|
Y |
|
values |
|
collapse |
|
dialogue |
button |
Series |
|
X |
|
values |
|
collapse |
|
dialogue |
button |
Scenario: The following data was collected from an experiment which measures the rate constant (k) of a first order hydrolysis reaction as a function of temperature.
Table 4: Temperature and Rate constant data for a first order hydrolysis reaction.
Temperature (K) |
Rate Constant, k (s−1) |
2.80×102 |
4.70 × 10−2 |
2.85×102 |
6.87 × 10−2 |
2.90×102 |
9.85 × 10−2 |
2.95×102 |
1.41 × 10−1 |
3.00×102 |
1.97 × 10−1 |
a) The rate constant, k, and the temperature, T, are related via equation (1):
−Ea
|
k=AeRT The parameter A is called the frequency factor, the units of which are identical to those of k (s−1 in this case), Ea is the energy of activation (units of kJ/mol), and R is the thermodynamic gas constant (8.31 x 10-3 kJ/K mol). Since the relationship between k and T is an exponential one, the data set, when plotted, will not be linear. Your goal is to mathematically transform Equation 1 from an exponential equation into a linear equation. |
(1) |
b) |
How do you take an exponential function and transform it into a linear function? Well, you have to transform the exponential term into a linear term. To do this, simply take the natural logarithm of both sides of the equation and simplify. This gives a new, equation (2) below. E lnk ln A a |
|
= − RT It is this function that you must plot to get a straight line. But exactly what are you going to plot on the x axis and y axis to create this linear graph? If you rearrange the previous equation, it might become evident (Equation 3). Equation 3 is in the slope-intercept form of a straight line. Identify y, x, m and b in Equation 3 and record them on your report. E lnk a 1 ln A |
(2) |
=− ( )+
R T (3)
the correct number of significant digits.
? the value of Ea in kJ/mol. ? the value of A, in s-1.
Record and show your work with correct final number of significant digits for the calculations on the report. Include the table of the new calculated data and the actual plot in your assignment.
Part 5: Statistical Analysis
Scenario: A sample of water was analysed for sulfate ion (SO4-2) concentration. Ten separate analyses were conducted. The results of the analyses are presented in Table 5.
Table 5: Sulfate ion concentration in ppm for water samples in lab #1 at WLU.
Analysis # |
1 |
2 |
3 |
4 |
5 |
6 |
7 |
8 |
9 |
10 |
WLU Lab#1 (ppm) |
35.9 |
43.2 |
33. 5 |
35.1 |
32.8 |
37.6 |
31.9 |
36. 6 |
35.0 |
32.0 |
Simple statistical analyses of this data set include determinations of the mean and median concentration, and the standard deviation of the data set. Both the mean and median are measures of central tendency of the data, commonly referred to as the average. Standard deviation is a measure of the amount of deviation or variation in the data set. Standard deviation is a measure of precision. A small standard deviation indicates a data set that tends to group closely together, while a large standard deviation indicates a data set that is spread out over a large range.
The mean or arithmetic mean ( ¯x ) is defined as the sum (?) of each of the measurements (xi) in a data set divided by the number of measurements (N) (equation 4):
∑x i
¯x=
N The median (M) is the midpoint value of a numerically ordered data set, where half of the measurements are above the median and half are below. The median location of N measurements (equation 5) can be found using: |
(4) |
M=(N+1)/2 When N is an odd number, the formula yields an integer that represents the value corresponding to the median location in an ordered distribution of measurements. For example, in the set of numbers (3, 1, 5, 4, 9, 9, 8) the median location is (7 + 1) / 2, or the 4th value. When applied to the numerically ordered set (1, 3, 4, 5, 8, 9, 9), the number 5 is the 4th value and is thus the median – three scores are above 5 and three are below 5. Note that if there were only 6 numbers in the set (1, 3, 4, 5, 8, 9), the median location is (6 + 1) / 2, or the 3.5th value. In this case the median is half-way between the 3rd and 4th values in the ordered distribution, or 4.5. Standard deviation (s) is a measure of the variation in a data set, and is defined as the square root of the sum of squares divided by the number of measurements minus one (equation 6): |
(5) |
s
(6)
To find s, subtract each measurement from the mean, square that result, add it to the results of each other difference squared, divide that sum by the number of measurements minus one, then take the square root of this result. The larger the standard deviation, the greater the variation in the data and the lower the precision in the measurements.
We use significant arithmetic (significant figures rules) as a way to maintain significant digits when determining standard deviation. The standard deviation and the average should both end at the same decimal place. For example, if we had determined that the average mark on a class test, marked to unit precision, was 72.36598 with a standard deviation of 7.22014, we would express the class result as 72 (7), where 72 is the average, the standard deviation is reported in parenthesis, and both are reported to the same number of decimals.
While the mean, median and standard deviation can be calculated with a calculator, it is often more convenient to use a computer to determine these values. Microsoft Excel is particularly well suited for such statistical analyses, especially on large data sets.
To compute the mean:
To compute the median:
To compute the standard deviation:
Rejecting Outliers
Do all the measurements in the WLU #1 data set look equally good to you, or are there any points that do not seem to fit with the others? If so, is it appropriate to reject these measurements?
Outliers are data points which lie far outside the range defined by the rest of the measurements and may skew your results to a great extent. If you determine that an outlier resulted from an obvious experimental error, for example you incorrectly read an instrument or incorrectly prepared a solution, you may reject the point without hesitation. If, however, none of these errors are evident, you must use caution in making your decision to keep or reject a point. One rough criterion for rejecting a data point is if it lies beyond two standard deviations from the mean.
f) Did the standard deviation change between the original and re-calculated data? If so, what does this represent for your recalculated data? Answer on the report sheet.
Rejecting data points may not be done just because you want your data to look better. If you choose to reject an outlier for any reason, you must always clearly document in your lab report or on your data sheet:
Failure to disclose this could constitute scientific fraud.
Adapted from: Scholefield, Michelle. “Using Excel for Graphical Analysis of Data”.
Chemistry 11 Lab Experiments. Santa Monica College. CA. USA, 2007.
Name: Date Submitted: ID#:_________________________________ Lab Section:
ASSIGNMENT 1 – To Excel in Chemistry
Instructions :
Fill in the blanks using a PEN OR TYPED answers and submit all sheets and required graphs/tables only in ONE document to be uploaded to your online CH110 Lab MyLS dropbox. No other lab report criteria is required.
Part 1: Simple Linear Plot |
? |
Which set of data is plotted on the y-axis? |
|
|
the x-axis? |
|
? |
Record the following information: |
|
|
The equation of the fitted trendline |
|
The value of the slope of this line
The value of the y-intercept of this line
Show your calculations for b) below:
Part 2: Using Functions |
|
Time vs. Temperature |
Time vs. ln(Temp) |
Time vs. (Temp)0.5 |
Time vs. (Temp)−0.5 |
R2 |
|
|
|
|
The equation of the trendline for the most linear plot
The value of the slope of this line
Part 3: Two Data Sets and Overlay
Nanoprism A: Nanoprism B:
Part 4: Choosing Correct Parameters for Graphing
E a 1
lnk=− ( )+ln A
R T
If this equation is in the slope-intercept form of a line, which term in this equation corresponds to:
y? x? m?
b?
Equation: _______
Part 5: Statistical Analysis |
the mean SO42- concentration the median SO42- concentration the standard deviation in the data set
If yes, which measurement(s) are the outliers?
Show the calculations you used to identify the outlier(s). If you determined that there were no outliers, explain how you came to this conclusion.
the mean SO4-2 concentration the median SO4-2 concentration the standard deviation in the data set
Purchased 8 times