.. meta::
:description: Orange3 Textable Prototypes documentation, Gutenberg
widget
:keywords: Orange3, Textable, Prototypes, documentation, Gutenberg,
widget
.. _Gutenberg:
Gutenberg
=========
.. image:: figures/gutenberg.png
Retrieve texts from gutenberg.org
Author
------
Florian Rieder, Paul Zignani
Signals
-------
Input: None
Outputs:
* ``Text data``
A segmentation with the selected texts.
Description
-----------
The widget is designed to import one or more texts from ``__ in Orange Canvas.
The output is a segmentation containing a segment for each imported text, annoted by title author and language.
Interface
~~~~~~~~~
.. _Gutenberg_fig1:
.. figure:: figures/gutenberg_interface.PNG
:align: center
:scale: 75 %
:alt: Interface of the Gutenberg widget
Figure 1: **Gutenberg** widget interface.
The **Gutenberg** widget lets the user generate the cache of the
`Gutenberg `__ database, and search it.
The **Generate Cache** button allows the user to generate the gutenberg cache.
This has to be done only at first launch or if the database has been updated. It can
last about 5 to 10 minutes.
The **results** section allows the user to add or remove texts from
the search results.
The **corpus** section is the users "basket". He can add or remove texts from
his corpus in the **results** section.
The **Send** button triggers the emission of a segmentation to the output
connection(s). When selected, the **Send automatically** checkbox
disables the button and the widget attempts to automatically emit a
segmentation at every modification of its interface.
Caveat about searches
~~~~~~~~~~~~~~~~~~~~~
A search can be executed using one or more parameters. The only case it won't work
is if the *language* is set to `any` and the other parameters are empty. However it
is possible to make a search only by language.
The aurhors are written as `name, first name` in the database. However writing `first name name`
should also work for most of the authors.
In general using only one keyword in the inputs should give the most results.
Messages
--------
Information
~~~~~~~~~~~
* segments sent to output ( characters).*
This confirms that the widget has operated properly.
*The cache is being generated. This can take up to 10mn.*
This confirms that the cache is being generated. A message will appear
in the log once the cache is fully generated.
*The cache already exists.*
Informs the user that he has already downloaded the cache.
Warnings
~~~~~~~~
*Cache must be generated before fisrt launch, it can take up to 10mn*
Appears only if the cache dosen't exist. It hould appear only the first
time you create a gutenberg widget.
*Settings were changed, please click 'Send' when ready.*
Settings have changed but the **Send automatically** checkbox
has not been selected, so the user is prompted to click the **Send**
button (or equivalently check the box) in order for computation and data
emission to proceed.
*You didn't search anything*
The user wants to make a search without changing the initial settings.
*Your corpus is empty, please add some texts first*
The corpus is empty, so the user have to add some texts before sending.
Errors
~~~~~~
*An error occurred while building the cache.*
An error has prevented the cache generation.
*An error occurred while interrogating the cache.*
An error happend while searching the database.
*Couldn't download data from Gutenberg.*
An error has prevented the widget from accessing the
Gutenberg database or retrieving the data.