mirror-4get/api.txt

                        __ __             __
                       / // / ____ ____  / /_
                      / // /_/ __ `/ _ \/ __/
                     /__  __/ /_/ /  __/ /_
                       /_/  \__, /\___/\__/
                           /____/

           + Welcome to the 4get API documentation +

+ Terms of use
    Do NOT misuse the API. Misuses can include... ::

        1. Serp SEO scanning
        2. Intensive scraping
        3. Any other activity that isn't triggered by a human
        4. Illegal activities in Canada
        5. Constant "test" queries while developping your program
           (please cache the API responses!)


    Examples of good uses of the API ::

        1. A chatroom bot that presents users with search results
        2. Personal use
        3. Any other activity that is initiated by a human


    If you wish to engage in the activities listed under "misuses", feel
    free to download the source code of the project and running 4get
    under your own terms. Please respect the terms of use listed here so
    that this website may be available to all in the far future.

    P.s fuck whoever botted my site for months on end, choke on my dick
    lol!!!!

    Get your instance running here ::
        https://git.lolcat.ca/lolcat/4get

    Thanks!


+ Passes
    Depending of the instance, you may need to provide a "pass" token
    in the cookies of your request. These can be obtained from solving
    a captcha which will allow you to make 100 requests in the next 24
    hours. In the future, you will be able to ask the serber maintainer
    for a "pass" which will allow you to bypass the captcha requirement.

    The captcha doesn't need javascript to work.


+ Decode the data
    All payloads returned by the API are encoded in the JSON format. If
    you don't know how to tackle the problem, maybe programming is not
    for you.

    All of the endpoints use the GET method.


+ Check if an API call was successful
    All API responses come with an array index named "status". If the
    status is something else than the string "ok", something went wrong.
    You can supply the content of the "status" string back to your
    application to inform the user of what went wrong.

    The HTTP code will be 429 if your pass is invalid. It is set to 200
    otherwise.


+ Get the next page of results
    All API responses come with an array index named "npt". To get the
    next page of results, you must make another API call with &npt.

    Example ::

        + First API call
            /api/v1/web?s=higurashi

        + Second API call
            /api/v1/web?npt=ddg1._rJ2hWmYSjpI2hsXWmYajJx < ... >

    You shouldn't specify the search term, only the &npt parameter
    suffices.

    The first part of the token before the dot (ddg1) refers to an
    array position on the serber's memory. The second part is an
    encryption key used to decode the data at that position. This way,
    it is impossible to supply invalid pagination data and it is
    impossible for a 4get operator to peek at the private data of the
    user after a request has been made.

    The tokens will expire as soon as they are used or after a 15
    minutes inactivity period, whichever comes first.


+ Beware of null values!
    Most fields in the API responses can return "null". You don't need
    to worry about unset values.


+ API Parameters
    To construct a valid request, you can use the 4get web interface
    to craft a valid request, and replace "/web" with "/api/v1/web".


+ "date" and "time" parameters
    "date" always refer to a calendar date.
    "time" always refer to the duration of some media.

    They are both integers that uses seconds as its unit. The "date"
    parameter specifies the number of seconds that passed since January
    1st 1970.


             ______          __            _       __
            / ____/___  ____/ /___  ____  (_)___  / /______
           / __/ / __ \/ __  / __ \/ __ \/ / __ \/ __/ ___/
          / /___/ / / / /_/ / /_/ / /_/ / / / / / /_(__  )
         /_____/_/ /_/\__,_/ .___/\____/_/_/ /_/\__/____/
                          /_/

+ /ami4get
    Tells you basic information about the 4get instance. CORS requests
    are allowed on this endpoint.


+ /api/v1/web
    + &extendedsearch
        When using the ddg(DuckDuckGo) scraper, you may make use of the
        &extendedsearch parameter. If you need rich answer data from
        additional sources like StackOverflow, music lyrics sites, etc.,
        you need to specify the value of (string)"true".

        The default value is "false" for API calls.


    + Parse the "spelling"
        The array index named "spelling" contains 3 indexes ::

            spelling:
                type:         "including"
                using:        "4chan"
                correction:   '"4cha"'


        The "type" may be any of these 3 values. When rendering the
        autocorrect text inside your application, it should look like
        what follows right after the parameter value ::

            no_correction    <Empty>
            including        Including results for %using%. Did you mean
                             %correction%?

            not_many         Not many results for %using%. Did you mean
                             %correction%?


        As of right now, the "spelling" is only available on
        "/api/v1/web".


    + Parse the "answer"
        The array index named "answer" may contain a list of multiple
        answers. The array index "description" contains a linear list of
        nodes that can help you construct rich formatted data inside of
        your application. The structure is similar to the one below:

        answer:
            0:
                title: "Higurashi"
                description:
                    0:
                        type:     "text"
                        value:    "Higurashi is a great show!"
                    1:
                        type:     "quote"
                        value:    "Source: my ass"


        Each "description" node contains an array index named "type".
        Here is a list of them:

              text
            + title
              italic
            + quote
            + code
              inline_code
              link
            + image
            + audio


        Each individual node prepended with a "+" should be prepended by
        a newline when constructing the rendered description object.

        There are some nodes that differ from the type-value format.
        Please parse them accordingly ::

            + link
                type:     "link"
                url:      "https://lolcat.ca"
                value:    "Visit my website!"


            + image
                type:    "image"
                url:     "https://lolcat.ca/static/pixels.png"


            + audio
                type:    "audio"
                url:     "https://lolcat.ca/static/whatever.mp3"


        The array index named "table" is an associative array. You can
        loop over the data using this PHP code, for example ::

            foreach($table as $website_name => $url){ // ...


		The rest of the JSON is pretty self explanatory.


+ /api/v1/images
    All images are contained within "image". The structure looks like
    below ::

        image:
            0:
                title: "My awesome Higurashi image"
                source:
                    0:
                        url: "https://lolcat.ca/static/profile_pix.png"
                        width: 400
                        height: 400
                    1:
                        url: "https://lolcat.ca/static/pixels.png"
                        width: 640
                        height: 640
                    2:
                        url: "https://tse1.mm.bing.net/th?id=OIP.VBM3BQg
                        euf0-xScO1bl1UgHaGG"
                        width: 194
                        height: 160


    The last image of the "source" array is always the thumbnail, and is
    a good fallback to use when other sources fail to load. There can be
    more than 1 source; this is especially true when using the Yandex
    scraper, but beware of captcha rate limits.


+ /api/v1/videos
    The "time" parameter for videos may be set to "_LIVE". For live
    streams, the amount of people currently watching is passed in
    "views".


+ /api/v1/news
    Just make a request to "/api/v1/news?s=elon+musk". The payload
    has nothing special about it and is very self explanatory, just like
    the endpoint above.


+ /api/v1/music
    Each entry under "song" contains a array index called "stream" that
    looks like this ::

        endpoint: audio_sc
        url: https://api-v2.soundcloud <...>


    When the endpoint is "audio_sc", you MUST use 4get's audio_sc
    endpoint, for example, if you want an audio stream back. Otherwise,
    you are free to handle the json+m3u8 crap yourself. If the endpoint
    is equal to "audio", that URL SHOULD return a valid HTTP audio
    stream, and using the "audio" endpoint becomes optional again.


+ /favicon
    Get the favicon for a website. The only parameter is "s", and must
    include the protocol.

    Example ::

        /favicon?s=https://lolcat.ca


    If we had to revert to using Google's favicon cache, it will throw
    an error in the X-Error header field. If Google's favicon cache
    also failed to return an image, or if you're too retarded to specify
    a valid domain name, a default placeholder image will be returned
    alongside the "404" HTTP error code.


+ /proxy
    Get a proxied image. Useful if you don't want to leak your user's IP
    address. The parameters are "i" for the image link and "s" for the
    size.

    Acceptable "s" parameters:

        portrait     90x160
        landscape    160x90
        square       90x90
        thumb        236x180
        cover        207x270
        original     <Original resolution>

    You can also ommit the "s" parameter if you wish to view the
    original image. When an error occurs, an "X-Error" header field
    is set.


+ /audio
    Get a proxied audio file. Does not support "Range" headers, as it's
    only used to proxy small files.

    The parameter is "s" for the audio link.


+ /audio_sc
    Get a proxied audio file for SoundCloud. Does not support downloads
    trough WGET or CURL, since it returns 30kb~160kb "206 Partial
    Content" parts, due to technical limitations that comes with
    converting m3u8 playlists to seekable audio files. If you use this
    endpoint, you must support these 206 codes and also handle the
    initial 302 HTTP redirect. I used this method as I didn't want to
    store information about your request needlessly. This method also
    allows noJS users to access the files.

    The parameter is "s" for the SoundCloud JSON m3u8 abomination. It
    does not support "normal" SoundCloud URLs at this time.


+ Appendix
    If you have any questions or need clarifications, please send an
    email my way to will at lolcat.ca