Begiak

From Apertium
Revision as of 04:13, 8 December 2016 by Jatinluthra14 (talk | contribs)
Jump to navigation Jump to search

begiak is the irc bot in the #apertium irc channel. Its primary function is to report svn updates to various relevant repositories. However, it serves some other purposes, including to showcase some applications of the apertium translation API.

Background

Begiak was set up by conor_f for GCI 2012, and several modules were written and fixed by sushain, Qasim, and a number of other GCI students since then. It runs on apertium.vm.bytemark.co.uk. The code base is the mutantmonkey port to python3 of the phenny bot. Our codebase is currently hosted by Jonathan in his github.

Begiak means "the eyes" in Basque.

Repositories monitored

The following svn repositories are monitored by begiak. Begiak polls for updates when it receives a ping from the server. Also see below for some ways to manually update

AI Features

Begiak recognises the following commands.

Apertium features

These are special apertium-related or #apertium-specific commands, or were designed with apertium in mind.

  • .awik — Search for something on Apertium wiki or point a user to a Apertium wiki page.
  • .awikstats – "awikstats update LANG" should update the wikipage apertium-LANG/stats
  • .(eth|lg|ethno|logue|ethnologue) lg — Search Ethnologue.com for information on a desired language (given ISO 639 code or language name)
  • .fight nick — have begiak fight someone for you.
  • .follow nick lg1-lg2 (follower) — Follow someone and translate as they speak. (Translations are sent as notices to the follower (admin only!))
  • .following — List people currently being followed.
  • .gitserver [status|start|stop] — get the githook-server's status/start it/stop it. See Begiak's git plugin for more info.
  • .hug nick — have begiak hug someone for you.
  • .iso639 lg — look up ISO 639-1, -2 and -3 language codes either by code or by language name. Codes sourced from Ethnologue.com (639-3) and Wikipedia (-1 and -2)
  • .isodb update — update the local database of ISO 639 codes (admin only!)
  • .isodb status — see if the database updater (.isodb update) is running
  • .listpairs (lg) — List language pairs available for apertium translation
  • .listlangs — List languages available for apertium translation
  • .(ml|mailinglist) poll — Poll for new messages in apertium mailing lists (this is automatically done every five minutes).
  • .(ml|mailinglist) last list — Get the last message in a specified list.
  • .noexceptions (nick) — Tells someone there aren't ever any exceptions
  • .t ... lg1-lg2 ( | lg2-lg3 ...) — Apertium translate
  • .unfollow nick — Stop following a user.
  • .wikicount lg — Get the number of articles an official or incubated Wikipedia for the specified language has, given a Wikipedia code or partial language name.
  • begiak: esan! — Poll for svn updates ("esan!" means "say!" in Basque)
  • begiak: recent — Get most recent svn/git updates (Begiak's git plugin describes the git part)
  • begiak: info <repo> <rev> — Get information on a given revision of a given SVN or git repository (Begiak's git plugin describes the git part)

General features

These are more general-use commands; while many are useful to us in #apertium, they weren't designed specifically for #apertium (though some were hacked to make them more appropriate for #apertium).

  • .8ball — magic 8 ball
  • .away (status) — Sets your status to away
  • .back (status) — Sets your status to being available.
  • .beats — Shows the internet time in Swatch beats
  • .botsnack — Give begiak a snack for being good
  • .c — Query DuckDuckGo calculator
  • .catfacts — Random facts about cats
  • .chill — Measure the current channel chillness level.
  • .choose choice1 choice2 — begiak will choose one of the two options (randomly).
  • .ety — Look up the etymology of a word
  • .fcc callsign — Look up a callsign issued by the FCC
  • .g — Search on Google
  • .gc — Get the number of results on Google
  • .greeting — Manage Greetings
    • .greeting add name message — Add a greeting
    • .greeting del name - Delete a greeting
  • .in — Set a reminder
  • .imdb — Search imdb
  • .linx url — Upload url to linx.li
  • .npl — Shows the time from NPL's SNTP server
  • .nsfw link - for when a link isn't safe for work
  • .py — Evaluate an expression in python (This runs as a sandboxed web service, not on your computer)
  • .pesters
    • begiak: pester nick message — Sends a message to nick every 12 hours they're around (or something. settable in config)
    • .pesters snooze nick - 'snooze' a pester from nick
    • .pesters dismiss nick - to stop pestering nick
  • Point commands:
    • The point commands are used to point other users to the output of several supported commands which are as follows:
      • Apertium wiki( .awik )
      • Wikipedia( .wik )
      • Urban Dictionary( .urb )
      • Wikitionary( .w )
      • Clock( .time and .tz )
      • firespeaker quotes( .randquote )
      • Google Search ( .g )
    • Replace .wik or .awik with any of the above commands for appropriate output like 'svineet, "India, officially the Republic of India (Bhārat Gaṇarājya),[18][19][c] is a country in South Asia" - https://en.wikipedia.org/wiki/India' for .wik India -> svineet
      • .awik Begiak -> svineet
      • .awik Begiak → svineet
      • nick: .wik India
      • nick, .wik India
      • .awik point nick India
  • .queue — Manage lists.
    • .queue new name [item1, item2 ...] — Make a new queue
    • .queue name — Display a queue
    • .queue delete name — Delete a queue
    • .queue name add item — Add an item
    • .queue name remove item — Remove an item
    • .queue name pop — Pop an item off the queue
    • .queue name swap item1, item2 — Swap two items
  • .seen — When was the last time a user was seen?
  • .slogan word — Sloganize a word
  • .stache url — Moustachify an image
  • .time (TimeZone) — Returns current time in TimeZone (can be a city or a time zone e.g UTC). If TimeZone is empty, returns current time in GMT time zone.
  • .tz — A timezone translation plugin. Converts a time from one timezone to another. (e.g., .tz 335EST in CET, or .tz 335EST in London)
  • .tzdb — Manage database
    • .tzdb update — Updates a timezones database
    • .tzdb status — Returns a status of updating a timezones database
  • .title — Get the title of a link
  • .tells — See a summary of outstanding reminders you've sent to others using "begiak: tell"
    • begiak: tell nick message (**pm**) — Tells someone something the next time they say something in a channel begiak is monitoring. The **pm** option has the message sent privately.
    • .tells nick — List your reminders currently queued to that nick
    • .tells rm num — Delete a queued reminder using its number from .tells nick
  • .tock — Shows the time from the USNO's atomic clock
  • .u — Search for a unicode character
  • .urb — Query urban dictionary
  • .val url — Check a webpage using the W3C Markup Validator.
  • .w — look up a word on wiktionary
  • .wa — Query Wolfram Alpha
  • .weather — Get the weather for a city
  • .whereis nick — Tells you nick's current status.
  • .wik — Search for something on Wikipedia or point another user to a Wikipedia page.
  • .wuvt — Find out what radio station WUVT is currently playing
  • .yi — Shows whether it is currently yi or not
  • begiak: "parlez-vous français"? — Translate a phrase (google translate)
  • begiak: reload module — reload a module (admin only!)

??

These functions/plugins either don't seem to work or need some figuring out:

  • .tfw
  • rule34 disabled
  • lastfm
  • mylife (.fml .mlia)
  • oblique

Wishlist

  • Apertium and APY
  • awikstats:
    • count .rlx sizes ☭
    • count number of transfer rules ☭
    • ignore post-* (used to work?)
    • use each file's rev for pairs with their own monodices, e.g. Apertium-en-es/stats
  • URL interceptor
  • mediawiki plugins
    • have awik plugin search the apertium wiki and return top hit if a page isn't found (like wikipedia plugin) ☭
    • have a function that tells someone else the results of .awik or .wik, e.g. for mentors to point students to resources ☭
    • have better logic for text that's displayed for a page/section (need examples where it fails!) ☭
      • where there's immediately a subsection? ☭
      • where the first thing is a table or infobox? ☭
      • where the first . doesn't end the sentence? ☭
      • where the first thing is [[…|En français]] (though really, we should have a real template on the wiki for translations)
    • generalise mediawiki plugin?
  • find out why the "posted" command isn't working as expected
  • document any command that does not have information ☭
  • Pollers (svn, git, etc.)
    • show short links to new apertium-stuff posts (poll like with commits?) ☭
    • allow channel-specific filtering of commit (etc.?) poller reporting ☭
      • alternatively: begiak lacks a filter or two, sometimes fills up channels making S/N high. begiak should detect quiet channels and not output there.
    • svnpoller and gitpoller's lookup functions (i.e., repeat information on specific revision)
  • timezones
      • better mapping of city names to timezones

Help!

Begiak is controlled by an init script (/etc/init.d/begiak.sh) on bytemark, and can be started, stopped, and restarted by anyone with sudo access.