hp300/DOC/TODO.hp300

e6ec719fShibler1. Create and use an interrupt stack.
e6ec719fShibler   Well actually, use the master SP for kernel stacks instead of
e6ec719fShibler   the interrupt SP.  Right now we use the interrupt stack for
e575b9caShibler   everything.  Allows for more accurate accounting of systime.
e575b9caShibler   In theory, could also allow for smaller kernel stacks but we
e575b9caShibler   only use one page anyway.
e6ec719fShibler
e6ec719fShibler2. Copy/clear primitives could be tuned.
67720516Smckusick   What is best is highly CPU and cache dependent.  One thing to look
67720516Smckusick   at are the copyin/copyout primitives.  Rather than looping using
67720516Smckusick   MOVS instructions, you could map an entire page at a time and use
67720516Smckusick   bcopy, MOVE16, or whatever.  This would lose big on the VAC models
67720516Smckusick   however.
e6ec719fShibler
e6ec719fShibler3. Sendsig/sigreturn are pretty bogus.
e6ec719fShibler   Currently we can call a signal handler even if an excpetion
e6ec719fShibler   occurs in the middle of an instruction.  This causes the handler
e6ec719fShibler   to return right back to the middle of the offending instruction
e6ec719fShibler   which will most likely lead to another exception/signal.
e6ec719fShibler   Technically, I feel this is the correct behavior but it requires
e6ec719fShibler   saving a lot of state on the user's stack, state that we don't
e6ec719fShibler   really want the user messing with.  Other 68k implementations
e6ec719fShibler   (e.g. Sun) will delay signals or abort execution of the current
e6ec719fShibler   instruction to reduce saved state.  Even if we stick with the
e6ec719fShibler   current philosophy, the code could be cleaned up.
e6ec719fShibler
e6ec719fShibler4. Ditto for AST and software interrupt emulation.
e6ec719fShibler   Both are possibly over-elaborate and inefficiently implemented.
e6ec719fShibler   We could possibly handle them by using an appropriately planted
e6ec719fShibler   PS trace bit.
e6ec719fShibler
e575b9caShibler5. Make use of transparent translation registers on 030/040 MMU.
e6ec719fShibler   With a little rearranging of the KVA space we could use one to
e6ec719fShibler   map the entire external IO space [ 600000 - 20000000 ).  Since
e6ec719fShibler   the translation must be 1-1, this would limit the kernel to 6mb
e6ec719fShibler   (some would say that is hardly a limit) or divide it into two
e575b9caShibler   pieces.  Another promising use would be to map physical memory
e575b9caShibler   within the kernel.  This allows a much simpler and more efficient
e575b9caShibler   implementation of /dev/mem, pmap_zero_page, pmap_copy_page and
e575b9caShibler   possible even kernel-user cross address space copies.  However,
e575b9caShibler   it does eat up a significant piece of kernel address space.
e6ec719fShibler
67720516Smckusick6. Create a 32-bit timer.
67720516Smckusick   Timers 2 and 3 on the MC6840 clock chip can be concatonated together to
67720516Smckusick   get a 32-bit countdown timer.  There are at least three uses for this:
67720516Smckusick   1. Monitoring the interval timer ("clock") to detect lost "ticks".
67720516Smckusick      (Idea from Scott Marovich)
67720516Smckusick   2. Implement the DELAY macro properly instead of approximating with
67720516Smckusick      the current "while (--count);" loop.  Because of caches, the current
67720516Smckusick      method is potentially way off.
67720516Smckusick   3. Export as a user-mappable timer for high-precision (4us) timing.
67720516Smckusick   Note that by doing this we can no longer use timer 3 as a separate
67720516Smckusick   statistics/profiling timer.  Should be able to compile-time (runtime?)
67720516Smckusick   select between the two.
e6ec719fShibler
e6ec719fShibler7. Conditional MMU code sould be restructured.
e6ec719fShibler   Right now it reflects the evolutionary path of the code: 320/350 MMU
e6ec719fShibler   was supported and PMMU support was glued on.  The latter can be ifdef'ed
e6ec719fShibler   out when not needed, but not all of the former (e.g. ``mmutype'' tests).
e6ec719fShibler   Also, PMMU is made to look like the HP MMU somewhat ham-stringing it.
e6ec719fShibler   Since HP MMU models are dead, the excess baggage should be there (though
e6ec719fShibler   it could be argued that they benefit more from the minor performance
e6ec719fShibler   impact).  MMU code should probably not be ifdef'ed on model type, but
e6ec719fShibler   rather on more relevant tags (e.g. MMU_HP, MMU_MOTO).
e6ec719fShibler
67720516Smckusick8. Redo cache handling.
fa58284cShibler   There are way too many routines which are specific to particular
fa58284cShibler   cache types.  We should be able to come up with a more coherent
fa58284cShibler   scheme (though HP 68k boxes have just about every caching scheme
fa58284cShibler   imaginable: internal/external, physical/virtual, writeback/writethrough)
*81e3dc72Shibler   See, for example, Wheeler and Bershad in ASPLOS 92.  For more efficient
*81e3dc72Shibler   handling of physical caches see also Kessler and Hill in Nov. 92 TOCS.
32bac732Shibler
816ab3d7Shibler9. Sort the free page list.
816ab3d7Shibler   The DMA hardware on the 300 cannot do scatter/gather IO.  For example,
816ab3d7Shibler   if an 8k system buffer consists of two non-contiguous physical pages
816ab3d7Shibler   it will require two DMA transfers (and hence two interrupts) to do the
816ab3d7Shibler   operation.  It would take only one transfer if they were physically
816ab3d7Shibler   contiguous.  By keeping the free list ordered we could potentially
816ab3d7Shibler   allocate contiguous pages and reduce the number of interrupts.  We can
816ab3d7Shibler   consider doing this since pages in the free list are not reclaimed and
816ab3d7Shibler   thus we don't have to worry about distorting any LRU behavior.
32bac732Shibler----
32bac732ShiblerMike Hibler
32bac732ShiblerUniversity of Utah CSS group
32bac732Shiblermike@cs.utah.edu