Section Navigation
Section Overview
This section bridges the gap between average rates of change and instantaneous gradients using the concept of limits. You will master the Power Law of differentiation to find equations for tangents and normals, identify stationary points, and solve complex Paper 2 optimization problems involving real-world constraints.
Why This Matters for AIHL Exams
- The “Marginal” Connection: In AI HL, “Marginal Cost” or “Marginal Profit” are code words for the derivative ($$C'(x)$$). Expect these in business modeling contexts.
- Optimization Multi-Tool: Finding the maximum volume or minimum surface area using constraints is a high-yield 10–15 mark question pattern in Paper 2.
- Tangents & Normals: You will frequently be asked to find the intersection of normals or the equations of tangents as part of larger geometry-calculus crossover questions.
High-Score Focus (Level 6–7 Insight)
- Negative/Fractional Index Trap: Most errors occur before differentiation. Always rewrite expressions like $$\displaystyle\frac{4}{\sqrt{x}}$$ as $$4x^{-1/2}$$ before applying the power law to avoid basic algebraic slips.
- Normal vs. Tangent Gradient: Remember that the normal is perpendicular. A common Level 5 error is using the tangent gradient $$m$$ for the normal equation instead of $$-1/m$$.
- Constraint Substitution: In optimization, you cannot differentiate a function with two variables (e.g., $$r$$ and $$h$$). You must use the constraint equation to eliminate one variable first.
By the End of This Section, You Should Be Able To:
Suggested Study Path
|
Study MetricsCore: 45–60 mins Practice: 90–120 mins Exam Priority: ★★★★★ |
Â
Â
Limit Concepts and Derivative Definitions
Â
Â
Gradients of Curves and Tangents
Â
The gradient of a curve at a specific point is defined as the gradient of the tangent line that touches the curve at that point.
Â
Â
Key Idea: The Limit of a Chord
Â
To find the exact gradient of a tangent at point \( P(x, y) \), we consider a second point \( Q \) on the curve a small distance \( h \) away. The line connecting \( P \) and \( Q \) is called a chord.
Â
As the distance \( h \) approaches zero (\( h \to 0 \)), point \( Q \) moves closer to \( P \), and the gradient of the chord approaches the gradient of the tangent at \( P \).
Â
Â
For the function \( y = x^2 \), the gradient of the chord between \( (x, x^2) \) and \( (x+h, (x+h)^2) \) is:
Â
$$ \text{Gradient} = \frac{(x+h)^2 – x^2}{h} = \frac{x^2 + 2xh + h^2 – x^2}{h} = \frac{2xh + h^2}{h} = 2x + h $$
Â
As \( h \to 0 \), the term \( h \) vanishes, leaving the gradient of the tangent as \( 2x \).
Â
Definition: The Derivative
Â
The function that gives the gradient of the graph \( y = f(x) \) at the point \( x \) is written as \( f'(x) \) and is defined by the limit:
Â
$$ f'(x) = \lim_{h \to 0} \left( \frac{f(x+h) – f(x)}{h} \right) $$
Â
This gradient function, \( f'(x) \), is referred to as the derivative of \( x \).
Â
Â
Exam Tip
The notation \( \lim_{h \to 0} \) is read as “the limit as \( h \) tends to zero”.You will not be asked questions requiring deep use of this notation, but understanding that derivatives come from limits is essential for the course.
Â
Â
Rules for Differentiation
Â
Using the limit definition on various power functions allows us to discover a general pattern known as the Power Law.
Â
This allows us to find the derivative without calculating limits from first principles every time.
Â
Formula: The Power Law
Â
For any real number \( n \):
Â
- If \( f(x) = ax^n \), then \( f'(x) = anx^{n-1} \)
- If \( f(x) = c \) (a constant), then \( f'(x) = 0 \)
- If \( f(x) = mx \), then \( f'(x) = m \)
Â
Additionally, if \( h(x) = f(x) + g(x) \), then \( h'(x) = f'(x) + g'(x) \).
Â
Â
Worked Example
Â
Use the power law to find the derivative of each of the following functions:
Â
a. \( f(x) = 3x^2 + 2x + 7 \)
b. \( f(x) = \frac{3}{x^2} \)
c. \( f(x) = 4\sqrt{x} + \frac{1}{\sqrt{x}} \)
Â
Solution
Â
a. Applying the rules term by term
Â
Differentiate \( 3x^2 \): multiply by the power 2 and subtract 1 from the power. \( 3(2)x^{2-1} = 6x \).
Â
Differentiate \( 2x \): the gradient of a linear term \( mx \) is just \( m \), so \( 2 \).
Â
Differentiate \( 7 \): the derivative of a constant is \( 0 \).
Â
$$ f'(x) = 6x + 2 $$
Â
b. Using negative indices
Â
First, rewrite the function using a negative exponent.
Â
$$ f(x) = 3x^{-2} $$
Â
Apply the power law: Multiply by -2 and subtract 1 from the exponent.
Â
$$ f'(x) = 3(-2)x^{-2-1} = -6x^{-3} $$
$$ = -\frac{6}{x^3} $$
Â
c. Using fractional indices
Â
Rewrite the function using powers.
Â
$$ f(x) = 4x^{\frac{1}{2}} + x^{-\frac{1}{2}} $$
Â
Apply the power law to each term:
Â
$$ f'(x) = 4\left(\frac{1}{2}\right)x^{\frac{1}{2}-1} + \left(-\frac{1}{2}\right)x^{-\frac{1}{2}-1} $$
$$ = 2x^{-\frac{1}{2}} – \frac{1}{2}x^{-\frac{3}{2}} $$
$$ = \frac{2}{\sqrt{x}} – \frac{1}{2\sqrt{x^3}} $$
Â
Â
Gradients at Specific Points
Â
Once we have the derivative function \( f'(x) \), we can substitute specific values of \( x \) to find the gradient of the tangent at that point.
Â
Â
Worked Example
Â
For each of the functions below:
Â
i. Find \( f'(x) \)
ii. Find the gradient of the curve at the point where \( x = 2 \)
iii. Write down the set of values of \( x \) for which the function is increasing.
Â
a. \( f(x) = 2x^2 + 3x – 5 \)
b. \( f(x) = \displaystyle\frac{2}{x} + x, \quad x \neq 0 \)
Â
Solution
Â
Part a
Â
i. Differentiating term by term:
Â
$$ f'(x) = 4x + 3 $$
Â
ii. Substitute \( x = 2 \) into the derivative:
Â
$$ f'(2) = 4(2) + 3 = 11 $$
Â
iii. The function is increasing when its gradient is positive, i.e., \( f'(x) > 0 \).
Â
$$ 4x + 3 > 0 \Rightarrow 4x > -3 \Rightarrow x > -0.75 $$
Â
Â
Part b
Â
i. Rewrite as \( f(x) = 2x^{-1} + x \). Differentiate:
Â
$$ f'(x) = -2x^{-2} + 1 = -\frac{2}{x^2} + 1 $$
Â
ii. Substitute \( x = 2 \):
Â
$$ f'(2) = -\frac{2}{2^2} + 1 = -0.5 + 1 = 0.5 $$
Â
iii. Increasing when \( f'(x) > 0 \):
Â
$$ -\frac{2}{x^2} + 1 > 0 \Rightarrow 1 > \frac{2}{x^2} \Rightarrow x^2 > 2 $$
Â
This inequality holds when \( x > \sqrt{2} \) or \( x < -\sqrt{2} \).
Â
Values:
Â
\( x > 1.41 \) and \( x < -1.41 \).
Â
Â
Reflect
What is the relationship between the gradient of a curve and the sign of its derivative?If \( f'(x) \) is positive, the curve goes up (increases).If \( f'(x) \) is negative, the curve goes down (decreases).
Â
Â
Applications: Marginal Cost
Â
In economics, the “marginal” rate often refers to the derivative. Specifically, marginal cost is the rate at which cost changes as production increases. It approximates the cost of producing one more unit.
Â
Â
Worked Example
Â
A company produces motorbike helmets and the daily cost function can be modelled as:
Â
$$ C(x) = 600 + 7x – 0.0001x^3 \quad \text{for } 0 \le x \le 150 $$
Â
where \( x \) is the number of helmets produced and \( C \) is the cost in US dollars.
Â
a. Write down the daily cost to the company if no helmets are produced.
b. Find an expression for the marginal cost, \( C'(x) \).
c. Find the marginal cost if (i) 20 helmets are produced, (ii) 80 helmets are produced.
d. State the units of the marginal cost.
Â
Solution
Â
a. Cost at zero production
Â
Substitute \( x = 0 \) into \( C(x) \):
Â
$$ C(0) = 600 + 0 – 0 = \$600 $$
Â
This represents the fixed costs.
Â
b. Marginal Cost Expression
Â
Differentiate \( C(x) \) using the power law:
Â
$$ C'(x) = 7 – 3(0.0001)x^2 $$
$$ C'(x) = 7 – 0.0003x^2 $$
Â
c. Calculating values
Â
i. Substitute \( x = 20 \):
Â
$$ C'(20) = 7 – 0.0003(20)^2 = 7 – 0.0003(400) = 7 – 0.12 = 6.88 $$
Â
ii. Substitute \( x = 80 \):
Â
$$ C'(80) = 7 – 0.0003(80)^2 = 7 – 0.0003(6400) = 7 – 1.92 = 5.08 $$
Â
d. Units
Â
Since the derivative represents the change in Cost divided by the change in quantity (helmets), the unit is:
Â
$ per helmet
Â
(This is the extra cost for each new helmet made).
Â
Â
TOK
Â
The notation \( \lim_{h \to 0} \) is read as “the limit of… as \( h \) tends to zero”.
Â
Historically, the concept of a quantity becoming infinitely small without actually being zero caused great philosophical debate (e.g., Bishop Berkeley’s “ghosts of departed quantities”).
Â
How do we justify mathematical rigour when dealing with concepts that lie on the edge of the physical world’s logic?
Â
Alternative Notation
Â
While the function notation \( f'(x) \) is commonly used, there is an alternative notation for the derivative that explicitly shows the variables involved. If we have a relationship \( y = f(x) \), the derivative is often written as:
Â
$$ \frac{dy}{dx} $$
Â
This notation, introduced by Leibniz, reminds us that the gradient is derived from the “difference in \( y \)” divided by the “difference in \( x \)” (\( \Delta y / \Delta x \)) as the differences become infinitely small.
Â
TOK: Notation and Discovery
Â
Calculus was independently discovered by Isaac Newton (who used dot notation, \( \dot{y} \)) and Gottfried Wilhelm Leibniz (who used \( dy/dx \)).
Â
Leibniz’s notation is widely considered more intuitive for algebraic manipulation. Does the symbol we use to represent a concept influence our ability to understand it?
Exam Tip
There is no strict rule on which notation to use, but \( \frac{dy}{dx} \) is often preferred when dealing with equations relating two variables (e.g., \( y = \dots \)), while \( f'(x) \) is used for functions.
Â
Â
Â
Worked Example
Â
The tangent to the curve \( y = 2x^2 + 3x – 4 \) at the point \( A \) has a gradient of 11. Find the coordinates of \( A \).
Â
Solution
Â
First, we find the derivative function to determine the gradient formula.
Â
$$ \frac{dy}{dx} = 4x + 3 $$
Â
We are told that the gradient at point \( A \) is 11. Therefore, we equate the derivative to 11.
Â
$$ 4x + 3 = 11 $$
$$ 4x = 8 \rightarrow x = 2 $$
Â
Now, substitute \( x = 2 \) into the original equation to find the corresponding \( y \)-coordinate.
Â
$$ y = 2(2)^2 + 3(2) – 4 $$
$$ y = 8 + 6 – 4 = 10 $$
Â
So, the coordinates of \( A \) are \( (2, 10) \).
Â
Â
Equations of Tangents and Normals
Â
A tangent is a straight line that touches the curve at a point and has the same gradient as the curve at that point.
Â
A normal is a straight line that passes through the point of contact and is perpendicular to the tangent.
Â
Formula: Perpendicular Lines
Â
If two lines are perpendicular, the product of their gradients is \( -1 \).
Â
If the gradient of the tangent is \( m \), then the gradient of the normal is \( -\displaystyle\frac{1}{m} \).
Â
Â
Â
Worked Example
Â
Find the equation of the tangent to the curve \( y = 2x^2 + 4\sqrt{x} \) at the point where \( x = 4 \).
Â
Solution
Â
Step 1: Find the derivative.
Â
Rewrite the term \( 4\sqrt{x} \) as \( 4x^{\frac{1}{2}} \) to apply the power rule.
Â
$$ y = 2x^2 + 4x^{\frac{1}{2}} $$
Â
$$ \frac{dy}{dx} = 4x + 4(\frac{1}{2})x^{-\frac{1}{2}} = 4x + \frac{2}{\sqrt{x}} $$
Â
Step 2: Calculate the gradient at \( x = 4 \).
Â
$$ \text{Gradient } m = 4(4) + \frac{2}{\sqrt{4}} = 16 + \frac{2}{2} = 17 $$
Â
Step 3: Find the \( y \)-coordinate at \( x = 4 \).
Â
$$ y = 2(4)^2 + 4\sqrt{4} = 2(16) + 4(2) = 32 + 8 = 40 $$
Â
The point is \( (4, 40) \).
Â
Step 4: Write the equation of the line.
Â
Using the point-slope form \( y – y_1 = m(x – x_1) \):
Â
$$ y – 40 = 17(x – 4) $$
$$ y = 17x – 68 + 40 $$
$$ y = 17x – 28 $$
Â
Â
Worked Example
Â
Find the equation of the normal to the curve defined by \( f(x) = 2x^3 + 3x – 2 \) at the point where \( x = 1 \).
Â
Solution
Â
Step 1: Find the point of contact.
Â
$$ f(1) = 2(1)^3 + 3(1) – 2 = 2 + 3 – 2 = 3 $$
Â
So the point is \( (1, 3) \).
Â
Step 2: Find the gradient of the tangent.
Â
Differentiate the function:
Â
$$ f'(x) = 6x^2 + 3 $$
Â
Substitute \( x = 1 \):
Â
$$ f'(1) = 6(1)^2 + 3 = 9 $$
Â
The gradient of the tangent is 9.
Â
Step 3: Find the gradient of the normal.
Â
Since the normal is perpendicular to the tangent:
Â
$$ m_{\text{normal}} = -\frac{1}{m_{\text{tangent}}} = -\frac{1}{9} $$
Â
Step 4: Write the equation.
Â
$$ y – 3 = -\frac{1}{9}(x – 1) $$
Â
Multiplying by 9:
Â
$$ 9y – 27 = -(x – 1) $$
$$ 9y – 27 = -x + 1 $$
$$ x + 9y – 28 = 0 $$
Â
Â
Worked Example
Â
The gradient of the normal to the graph of the function defined by \( f(x) = kx^3 – 2x + 1 \) at the point \( (1, b) \) is \( -\frac{1}{4} \).
Â
Find the values of \( k \) and \( b \).
Â
Solution
Â
Step 1: Use the normal gradient to find the tangent gradient.
Â
If the normal gradient is \( -\frac{1}{4} \), the tangent gradient must be the negative reciprocal:
Â
$$ m_{\text{tangent}} = 4 $$
Â
Step 2: Find \( k \) using the derivative.
Â
$$ f'(x) = 3kx^2 – 2 $$
Â
At \( x = 1 \), the derivative equals the tangent gradient:
Â
$$ f'(1) = 3k(1)^2 – 2 = 4 $$
$$ 3k – 2 = 4 $$
$$ 3k = 6 \rightarrow k = 2 $$
Â
Step 3: Find \( b \) using the original function.
Â
Substitute \( k = 2 \) and \( x = 1 \) into \( f(x) \):
Â
$$ f(1) = 2(1)^3 – 2(1) + 1 $$
$$ b = 2 – 2 + 1 $$
$$ b = 1 $$
Â
Â
Local Maximum and Minimum Points
Â
The derivative allows us to identify turning points on a curve. At a smooth peak (maximum) or a smooth valley (minimum), the tangent is horizontal.
Â
Key Idea: Stationary Points
Â
A local maximum or minimum point occurs where the gradient of the curve is zero.
Â
$$ f'(x) = 0 $$
Â
- Local Maximum: The gradient changes from positive (increasing) to negative (decreasing).
- Local Minimum: The gradient changes from negative (decreasing) to positive (increasing).
Â
Â
Worked Example
Â
Consider the derivative function \( \frac{dy}{dx} = 1 – \frac{1}{x^2} \) for \( x \neq 0 \).
Â
a. Find the values of \( x \) at which \( \frac{dy}{dx} = 0 \).
b. State whether these points represent a local maximum or minimum on the curve for \( y \).
Â
Solution
Â
a. Finding stationary points
Â
Set the derivative to zero:
Â
$$ 1 – \frac{1}{x^2} = 0 $$
$$ 1 = \frac{1}{x^2} $$
$$ x^2 = 1 \rightarrow x = \pm 1 $$
Â
So, the stationary points are at \( x = 1 \) and \( x = -1 \).
Â
b. Determining the nature of the points
Â
We examine the sign of the gradient around the points.
Â
For \( x = 1 \):
Test \( x = 0.5 \): \( \frac{dy}{dx} = 1 – \frac{1}{0.25} = 1 – 4 = -3 \) (Negative)
Test \( x = 2 \): \( \frac{dy}{dx} = 1 – \frac{1}{4} = 0.75 \) (Positive)
Â
Since the gradient goes Negative \(\to\) Zero \(\to\) Positive, \( x = 1 \) corresponds to a Local Minimum.
Â
For \( x = -1 \):
Test \( x = -2 \): \( \frac{dy}{dx} = 1 – \frac{1}{4} = 0.75 \) (Positive)
Test \( x = -0.5 \): \( \frac{dy}{dx} = 1 – 4 = -3 \) (Negative)
Â
Since the gradient goes Positive \(\to\) Zero \(\to\) Negative, \( x = -1 \) corresponds to a Local Maximum.
Â
Â
Conceptual Prompt
We found that \( f'(x) = 0 \) at maximum and minimum points. Are there points where \( f'(x) = 0 \) but the point is neither a maximum nor a minimum? (Think about the graph of \( y = x^3 \) at \( x=0 \)).
Â
Â
In many practical situations, such as manufacturing or economics, we wish to find the “best” possible solution—whether that is maximizing profit, minimizing cost, or minimizing waste material.
Â
In calculus, this process is known as optimization.
Â
Â
Solving Problems with Constraints
Â
Key Idea: The Method of Constraints
Â
If a function to be optimized has more than one variable (e.g., \( r \) and \( h \)), you cannot differentiate it immediately.
Â
You must be given a fixed condition, called a constraint (e.g., Volume = 500).
Â
1. Write the constraint as an equation.
2. Rearrange the constraint to make one variable the subject (e.g., \( h = \dots \)).
3. Substitute this expression into the function you want to optimize.
4. The function now has only one variable and can be differentiated.
Â
TOK: Notation and History
Â
The French mathematician Joseph Lagrange invented the prime notation \( f'(x) \) to denote a derivative.
Â
This alternative to Leibniz’s \( \frac{dy}{dx} \) is compact and highlights the functional relationship.
Â
How does the choice of notation in mathematics shape our thinking about the concepts they represent?
Â
Â
Worked Example
Â
A can of dog food contains \( 500 \text{ cm}^3 \) of food. The manufacturer, wanting to make sure that the company received maximum profits, would like to make sure that the surface area of the can was as small as possible.
Â
Let the radius of the can be \( r \) cm and the height, \( h \) cm.
Â
a. Find an expression for the surface area \( S \) in terms of \( r \).
b. Find \( \frac{dS}{dr} \).
c. Hence, find the dimensions of the can that will have the minimum surface area.
Â
Solution
Â
a. Finding the expression for Surface Area
Â
The surface area of a cylinder is given by the sum of the curved surface area and the two circular ends:
Â
$$ S = 2\pi rh + 2\pi r^2 $$
Â
However, this equation has two variables, \( r \) and \( h \). We need to eliminate \( h \) using the volume constraint.
Â
We know the volume is \( 500 \text{ cm}^3 \):
Â
$$ V = \pi r^2 h = 500 $$
Â
Rearranging to solve for \( h \):
Â
$$ h = \frac{500}{\pi r^2} $$
Â
Now, substitute this expression for \( h \) into the surface area equation:
Â
$$ S = 2\pi r \left( \frac{500}{\pi r^2} \right) + 2\pi r^2 $$
$$ S = \frac{1000}{r} + 2\pi r^2 $$
Â
b. Finding the derivative
Â
First, rewrite the term \( \frac{1000}{r} \) using a negative exponent so we can apply the power rule:
Â
$$ S = 1000r^{-1} + 2\pi r^2 $$
Â
Now, differentiate with respect to \( r \):
Â
$$ \frac{dS}{dr} = -1000r^{-2} + 4\pi r $$
Â
c. Finding the minimum dimensions
Â
A local maximum or minimum occurs when the gradient is zero.
Â
$$ \frac{dS}{dr} = 0 $$
$$ -1000r^{-2} + 4\pi r = 0 $$
$$ \frac{-1000}{r^2} + 4\pi r = 0 $$
Â
Multiply by \( r^2 \) (assuming \( r \neq 0 \)) or rearrange:
Â
$$ 4\pi r = \frac{1000}{r^2} $$
$$ 4\pi r^3 = 1000 $$
$$ r^3 = \frac{1000}{4\pi} = \frac{250}{\pi} $$
$$ r = \sqrt[3]{\frac{250}{\pi}} \approx 4.30 \text{ cm} $$
Â
Now we find the corresponding height \( h \) using the equation from part a:
Â
$$ h = \frac{500}{\pi (4.30)^2} \approx 8.60 \text{ cm} $$
Â
So, the best dimensions for the can are \( r \approx 4.3 \text{ cm} \) and \( h \approx 8.6 \text{ cm} \).
Â
Â
Exam Tip
Â
The question asks “Hence”, which implies you must use your previous result.To justify that this solution is indeed a minimum, you could show that the gradient changes from negative to positive across the point \( r = 4.3 \), or use the second derivative test (if relevant to your syllabus level).
Â
Conceptual Prompt
Â
Notice in the solution above that the optimal height \( h \approx 8.6 \) is exactly double the optimal radius \( r \approx 4.3 \). In other words, \( h = 2r \), or height equals diameter.Why might this “square profile” (when viewed from the side) be the most efficient shape for minimizing surface area for a cylinder?
Â
🎯 Examiner’s Radar: Calculus & Optimization Logic
📋 Paper Mapping
- Paper 2 (Optimization): This is a “Big Points” topic. Examiners check for the substitution of the constraint into the objective function. Missing this setup step usually results in losing the entire 10-mark block.
- Paper 3 (Modeling): You may be asked to interpret $$f'(x)$$ in context. If $$f(x)$$ is volume, $$f'(x)$$ is the rate of change of volume. Use specific units like $$\text{cm}^3/\text{s}$$ to secure the interpretation mark.
💡 Scoring Secrets
- “Show That” traps: If a question says “Show that $$S = \displaystyle\frac{1000}{r} + 2\pi r^2$$”, do not differentiate yet. You must show the algebraic substitution of $$h$$ from the volume constraint.
- The Perpendicular Rule: When finding the Normal, always state $$m_1 m_2 = -1$$ or $$m_{\text{norm}} = -1/m_{\text{tan}}$$. Explicitly writing this formula often grants a “Method Mark” even if your calculation slips.






