rougier
diff --git a/‎01-preface.rst
Lines changed: 9 additions & 9 deletions b/‎01-preface.rst
Lines changed: 9 additions & 9 deletions
diff --git a/‎02-introduction.rst
Lines changed: 65 additions & 2 deletions b/‎02-introduction.rst
Lines changed: 65 additions & 2 deletions
diff --git a/‎03-anatomy.rst
Lines changed: 84 additions & 81 deletions b/‎03-anatomy.rst
Lines changed: 84 additions & 81 deletions
@@ -100,15 +100,15 @@ If you want to contribute to this book, you can:
 Publishing
 ++++++++++
 
-If you're an editor interested in publishing this book, you can contact me if
-you agree to have this version and all subsequent versions open access
-(i.e. online), you know how to deal with `restructured text
-<http://docutils.sourceforge.net/rst.html>`_ (Word is not an option), you
-provide a real added-value as well as supporting services, and more
-importantly, you have a truly amazing latex book template (and be warned that
-I'm a bit picky about typography & design: E.Tufte is my hero).
-
-Still here?
+If you're an editor interested in publishing this book, you can `contact me
+<mailto:[email protected]>`_ if you agree to have this version and all
+subsequent versions open access (i.e. online at `this address
+<http://www.labri.fr/perso/nrougier/from-python-to-numpy>`_), you know how to
+deal with `restructured text <http://docutils.sourceforge.net/rst.html>`_ (Word
+is not an option), you provide a real added-value as well as supporting
+services, and more importantly, you have a truly amazing latex book template
+(and be warned that I'm a bit picky about typography & design: `Edward Tufte
+<https://www.edwardtufte.com/tufte/>`_ is my hero). Still here?
 
 
 License
 
@@ -1,5 +1,68 @@
 Introduction
 ===============================================================================
 
-.. contents:: **Contents**
-   :local:
+Numpy is all about vectorization.
+
+If you are familiar with Python, this is the main difficulty you'll face
+because it requires for you to change your way of thinking and your new friends
+are named vectors, arrays, views or ufuncs.
+
+Let's take a very simple example: random walk. One possible object oriented
+approach would be to define a `RandomWalker` class and to write with a walk
+method that would return current position after each (random) steps. It's nice,
+but is is slow:
+
+**Object oriented approach**
+
+.. code:: python
+
+   class RandomWalker:
+      def __init__(self):
+          self.steps = []
+          self.position = 0
+
+      def walk(self, n):
+          yield self.position
+          for i in range(n):
+              step = 2*random.randint(0, 1) - 1
+              self.position += step
+              yield self.position
+           
+   walker = RandomWalker()
+   walk = []
+   for position in walker.walk(1000):
+       walk.append(position)
+
+
+       
+**Functional approach**
+
+For such a simple problem, we can probably save the class definition and
+concentrate only on the walk method that compute successive positions after
+each random steps.
+
+.. code:: python
+
+   def random_walk(n):
+       position = 0
+       walk = [position]
+       for i in range(n):
+           step = 2*random.randint(0, 1)-1
+           position += step
+           walk.append(position)
+       return walk
+
+   walk = random_walk(1000)
+
+**Vectorized approach**
+
+But, we can further simplifying things by considering a random walk to be
+composed of a number of steps and corresponding positions are the cumulative
+sum of these steps.
+
+.. code:: python
+       
+   steps = 2*np.random.randint(0, 2, size=n) - 1
+   walk = np.cumsum(steps)
+
+   
@@ -3,115 +3,118 @@ Anatomy of an array
 
 .. contents:: **Contents**
    :local:
+
+|WIP|
 
-Data type
----------
+..
+   Data type
+   ---------
+
+   Memory layout
+   -------------
+
+   View and copy
+   -------------
+
+   Let's consider two vectors `Z1` and `Z2`. We would like to know if `Z2` is a
+   view of `Z1` and if yes, what is this view ? Let's consider a simple example:
+
+   .. code-block::
 
-Memory layout
--------------
+      >>> Z1 = np.arange(10)
+      >>> Z2 = Z1[1:-1:2]
 
-View and copy
--------------
+   .. code-block::
+      :class: output
 
-Let's consider two vectors `Z1` and `Z2`. We would like to know if `Z2` is a
-view of `Z1` and if yes, what is this view ? Let's consider a simple example:
+         ╌╌╌┬───┬───┬───┬───┬───┬───┬───┬───┬───┬───┬╌╌
+      Z1    │ 0 │ 1 │ 2 │ 3 │ 4 │ 5 │ 6 │ 7 │ 8 │ 9 │
+         ╌╌╌┴───┴───┴───┴───┴───┴───┴───┴───┴───┴───┴╌╌
+         ╌╌╌╌╌╌╌┬───┬╌╌╌┬───┬╌╌╌┬───┬╌╌╌┬───┬╌╌╌╌╌╌╌╌╌╌
+      Z2        │ 1 │   │ 3 │   │ 5 │   │ 7 │
+         ╌╌╌╌╌╌╌┴───┴╌╌╌┴───┴╌╌╌┴───┴╌╌╌┴───┴╌╌╌╌╌╌╌╌╌╌
 
-.. code-block::
+   First test is to check whether `Z1` is the base of `Z2`
 
-   >>> Z1 = np.arange(10)
-   >>> Z2 = Z1[1:-1:2]
+   .. code-block::
 
-.. code-block::
-   :class: output
+      >>> print(Z2.base is Z1)
+      True
 
-      ╌╌╌┬───┬───┬───┬───┬───┬───┬───┬───┬───┬───┬╌╌
-   Z1    │ 0 │ 1 │ 2 │ 3 │ 4 │ 5 │ 6 │ 7 │ 8 │ 9 │
-      ╌╌╌┴───┴───┴───┴───┴───┴───┴───┴───┴───┴───┴╌╌
-      ╌╌╌╌╌╌╌┬───┬╌╌╌┬───┬╌╌╌┬───┬╌╌╌┬───┬╌╌╌╌╌╌╌╌╌╌
-   Z2        │ 1 │   │ 3 │   │ 5 │   │ 7 │
-      ╌╌╌╌╌╌╌┴───┴╌╌╌┴───┴╌╌╌┴───┴╌╌╌┴───┴╌╌╌╌╌╌╌╌╌╌
+   At this point, we know `Z2` is a view of `Z1`, meaning `Z2` can be expressed as
+   `Z1[start:stop:step]`. The difficulty now is to find `start`, `stop` and
+   `step`.  For the `step`, we can use the `strides` property of any array that
+   gives the number of bytes to go from one element to the other in each
+   dimension. In our case, and because both arrays are one-dimensional, we can
+   directly compare the first stride only:
 
-First test is to check whether `Z1` is the base of `Z2`
+   .. code-block::
 
-.. code-block::
+      >>> step = Z2.strides[0] // Z1.strides[0]
+      >>> print(step)
+      2
 
-   >>> print(Z2.base is Z1)
-   True
+   Next difficulty is to find the `start` and the `stop` indices. To do this, we
+   can take advantage of the `byte_bounds` method that returns a pointer to the
+   end-points of an array.
 
-At this point, we know `Z2` is a view of `Z1`, meaning `Z2` can be expressed as
-`Z1[start:stop:step]`. The difficulty now is to find `start`, `stop` and
-`step`.  For the `step`, we can use the `strides` property of any array that
-gives the number of bytes to go from one element to the other in each
-dimension. In our case, and because both arrays are one-dimensional, we can
-directly compare the first stride only:
+   .. code-block::
+      :class: output
 
-.. code-block::
+        byte_bounds(Z1)[0]                  byte_bounds(Z1)[-1]
+                ↓                                   ↓ 
+         ╌╌╌┬───┬───┬───┬───┬───┬───┬───┬───┬───┬───┬╌╌
+      Z1    │ 0 │ 1 │ 2 │ 3 │ 4 │ 5 │ 6 │ 7 │ 8 │ 9 │
+         ╌╌╌┴───┴───┴───┴───┴───┴───┴───┴───┴───┴───┴╌╌
 
-   >>> step = Z2.strides[0] // Z1.strides[0]
-   >>> print(step)
-   2
+            byte_bounds(Z2)[0]      byte_bounds(Z2)[-1]
+                    ↓                       ↓ 
+         ╌╌╌╌╌╌╌┬───┬╌╌╌┬───┬╌╌╌┬───┬╌╌╌┬───┬╌╌╌╌╌╌╌╌╌╌
+      Z2        │ 1 │   │ 3 │   │ 5 │   │ 7 │
+         ╌╌╌╌╌╌╌┴───┴╌╌╌┴───┴╌╌╌┴───┴╌╌╌┴───┴╌╌╌╌╌╌╌╌╌╌
 
-Next difficulty is to find the `start` and the `stop` indices. To do this, we
-can take advantage of the `byte_bounds` method that returns a pointer to the
-end-points of an array.
 
-.. code-block::
-   :class: output
+   .. code-block::
 
-     byte_bounds(Z1)[0]                  byte_bounds(Z1)[-1]
-             ↓                                   ↓ 
-      ╌╌╌┬───┬───┬───┬───┬───┬───┬───┬───┬───┬───┬╌╌
-   Z1    │ 0 │ 1 │ 2 │ 3 │ 4 │ 5 │ 6 │ 7 │ 8 │ 9 │
-      ╌╌╌┴───┴───┴───┴───┴───┴───┴───┴───┴───┴───┴╌╌
+      >>> offset_start = np.byte_bounds(Z2)[0] - np.byte_bounds(Z1)[0]
+      >>> print(offset_start) # bytes
+      8 
 
-         byte_bounds(Z2)[0]      byte_bounds(Z2)[-1]
-                 ↓                       ↓ 
-      ╌╌╌╌╌╌╌┬───┬╌╌╌┬───┬╌╌╌┬───┬╌╌╌┬───┬╌╌╌╌╌╌╌╌╌╌
-   Z2        │ 1 │   │ 3 │   │ 5 │   │ 7 │
-      ╌╌╌╌╌╌╌┴───┴╌╌╌┴───┴╌╌╌┴───┴╌╌╌┴───┴╌╌╌╌╌╌╌╌╌╌
+      >>> offset_stop = np.byte_bounds(Z2)[-1] - np.byte_bounds(Z1)[-1]
+      >>> print(offset_stop) # bytes
+      -16
 
+   Converting these offsets into indices is straightforward using the `itemsize`
+   and taking into account that the `offset_stop` is negative (end-bound of `Z2`
+   is logically smaller than end-bound of `Z1` array). We thus need to add the
+   items size of Z1 to get the right end index.
 
-.. code-block::
+   .. code-block::
 
-   >>> offset_start = np.byte_bounds(Z2)[0] - np.byte_bounds(Z1)[0]
-   >>> print(offset_start) # bytes
-   8 
-   
-   >>> offset_stop = np.byte_bounds(Z2)[-1] - np.byte_bounds(Z1)[-1]
-   >>> print(offset_stop) # bytes
-   -16
+      >>> start = offset_start // Z1.itemsize
+      >>> stop = Z1.size + offset_stop // Z1.itemsize
+      >>> print(start, stop, step)
+      1, 8, 2
 
-Converting these offsets into indices is straightforward using the `itemsize`
-and taking into account that the `offset_stop` is negative (end-bound of `Z2`
-is logically smaller than end-bound of `Z1` array). We thus need to add the
-items size of Z1 to get the right end index.
-   
-.. code-block::
+   Last we test our results:
 
-   >>> start = offset_start // Z1.itemsize
-   >>> stop = Z1.size + offset_stop // Z1.itemsize
-   >>> print(start, stop, step)
-   1, 8, 2
+   .. code-block::
 
-Last we test our results:
+      >>> print(np.allclose(Z1[start,stop,step], Z2))
+      True
 
-.. code-block::
 
-   >>> print(np.allclose(Z1[start,stop,step], Z2))
-   True
-   
+   Exercice
+   ++++++++
 
-Exercice
-++++++++
+   As an exercise, you can improve this first and very simple implementation by
+   taking into account:
 
-As an exercise, you can improve this first and very simple implementation by
-taking into account:
+   * Negative steps
+   * Multi-dimensional arrays
 
-* Negative steps
-* Multi-dimensional arrays
 
-  
-Sources
-+++++++
+   Sources
+   +++++++
 
-* `find_index <../code/find_index.py>`_ (solution to the exercise)
+   * `find_index.py <code/find_index.py>`_ (solution to the exercise)